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PROSTATE SPErTFTr GENES ATVn TTTF 
USE THEREOF AS TARGETS FOR PROSTATE r ANrp-P 
THERAPY AND DIAGNOSIS 

FIELD OF THF. mvT^TvrrTrMvr 
The present invention relates to the identification of DNA sequaices that correspond to 
alternatively spUced events in genes expressed on the surface of prostate cancer cells. These 
g«ies or then: corresponding proteins are to be targeted for liie treatment, prevention and/or 
diagnosis of cancers wherein these genes are differentially regulated and/or spliced, 
particularly in prostate cancer. 



BACKGROUND OF THF INVENTION 

Genetic detection of human disease states is a rapidly developing field (T^arowsky et 
al., 1982; Slamon et a!., 1989; Sidransky et al., 1992; Mild et al., 1994; Dong et al., 1995; 
Morahanetal., 1996;Lifton, 1996; Barinaga, 1996). However, some problems exist ;^th this 
approach. A number of known genetic lesions merely predispose an individual to the 
development of specific disease states. Individuals carrying the genetic lesion may not develop 
the disease state, while other individuals may develop the disease state without possessing a 
particular genetic lesion. In human cancers, genetic defects may potentially occur in a large 
number of known tumor suppresser genes and proto-oncogenes. 

Genetic detection of cancer has a long history. Some of the earUest genetic lesions 
shown to predispose to cancer were transforming point mutations in the ras oncogenes 
(Taparowsky et al., 1982). Transforming ras point mutations maybe detected in the stool of 
individuals with benign and malignant colorectal tumors (Sidransky et al., 1992). However, 
only 50% of such tumors contained a ras mutation (Sidransky et al., 1992). Similar results 

havebeen obtained with ampUficationofHER-2/neuinbreastandprostatecancer(Slamonet 
al., 1989), deletion andmutation of p53inbladdercancer(Sidranskyetal., 1991), deletion of 
DCC in colorectal cancer (Fearon et al.. 1990) and mutation of BRCAl in breast and prostate 
cancer (MM et al., 1994). 

None of these genetic lesions are capable of predicting a majority of individuals with 
cancer and most require direct sampling of a suspected tumor, and make screening difficult. 
Further.noneofthemarkers described above are capableofdistinguishingbetweenmetastatic 
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and non-metastatic fonns of cancer. In effective management of cancer patients, identification 
of those individuals whose tumors have akeady metastasized or are likely to metastasize is 
critical. Because metastatic cancer kills 560.000 people in the U.S. each year (ACS home 

page), identificationofmarkersformetastaticprostatecancerwouldbe an important advance. 

A particular problem in cancer detection and diagnosis occurs with prostate cancer. 
Carcinoma of the prostate is the most frequentiy diagnosed cancer among men in the United 
States (Veltri et al., 1996). Prostate cancer was diagnosed in approximately 1 89,500 men in 
1998 and about 40,000 men succumbed to the malignancy (Landis et al, 1998). Although 
relatively few prostate tumors progress to clinical significance during the lifetime of the 
patient, those which are progressive in nature are likely to have metastasized by the time of 
detection. Survival rates for individuals withmetastaticprostatecancerarequitelow.Be^ 

these extremes arepatients with prostate tumors that will metastasizebuthavenotyet done so, 
for whom surgical prostate removal is curative. Determination of which group a patient folk 
within is critical in determining optimal treatment and patient survival. 

The FDA approval of the serum prostate specific antigen (PSA) test in 1984 changed 
thewaythatpiostatediseasewasmanaged(Allhoffetal., 1989; Cooneretal., 1990; Jacobson 
et al, 1995; Orozco et al., 1998). PSA is widely used as a serum biomarker to detect and 
monitor therapeutic response in prostate cancer patients (Badalament et al., 1 996; OT)owd et 
al., 1997). Several modifications in PSA assays (Partin and Oesterling, 1994; Babian et al., 
1996; Zlotta et al, 1997) have resulted in earlier diagnoses and improved treatment. 

AltiioughPSAhas been widelyusedasaclinicalmarkerofprostate cancer since 1988 
(Partin and Oesterling, 1994), screening programs utilizing PSA alone orin combination with 
digital rectal examination (DRE) have not been successfid in improving the survival rate for 
men with prostate cancer (Partin and Oesterling, 1 994). Although PSA is specific to prostate 
tissue, it is produced by normal and benign as well as malignant prostatic epithelium, resulting 
in a hifiji false-positive rate for prostate cancer detection (Partin and Oesterling, 1 994). 

While an effective indicator of prostate cancer when serum levels are relatively high, 
PSA serum levels are more ambiguous indicators of prostate cancer when only modestly 

elevated, for examplewhenlevelsarebetween2-10ng/ml.Atthesemodest elevations, semm 
PSA may have originated fi-om non-cancerous disease states such as BPH (benign prostatic 
hyperplasia), prostatitis or physical trauma (McCormack et al, 1 995). Altiiough appUcation of 
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the lower 2.0 ng/ml cancer detection cutoff concentration of serum PSA has increased the 
diagnosis of prostate cancer, especially in younger men with nonpalpable early stage tumors 
(Stage Tic) (Sohetal., 1997; Carter and Coffey, 1997; Harris etal., 1997;Oiozcoetal., 1998), 
the specificity of the PSA assay for prostate cancer detection at low serum PSA levels remains 
a problem. 

Several investigators have sought to improve upon the spedfidty of serologic detection 
of prostate cancer by examining a variety of other biomarkere besides serum PSA 
concentration (Ralph and Veltri, 1997). One of the most heavily investigated of these other 
biomaricers is the ratio of fi:ee versus total PSA (ft PSA) in a patient's blood. Most PSA in 
serum is in a molecular form that is bound to other proteins such as alphal -antichymotrypsin 
(ACT) or alpha2-macroglobulin (Christensson et al, 1993; Stenman et al., 1991; Lilja et al., 
1991). Free PSA is not bound to other proteins. The ratio of free to total PSA (i^tPSA) is 
usually significantiy higher in patients with BPH compared to those with organ confined 
prostate cancer (Marley et al., 1996; Oesterling et al., 1995; Pettersson et al., 1995). When an 
appropriate cutoff is determined for the fi'tPSA assay, the fi'tPSA assay can help distinguish 
patients with BPH firam those with prostate cancer in cases in which serum PSA levels are 
only modestly elevated (Marley etal., 1 996; Partin and Oesterling, 1996). Unfortunately, while 
CtPSA may improve on the detection of prostate cancer, information in the FtPSA ratio is 
insufficient to improve the sensitivity and specificity of serologic detection of prostate cancer 
to desirable levels. 

Other markers that have been used for prostate cancer detection include prostatic acid 
phosphatase (PAP) and prostate secreted protein (PSP). PAP is secreted by prostate cells under 
hormonal control (Brawn et al., 1996). Ithas less specificity and sensitivity than does PSA. As 
a result, it is used much less now, although PAP may still have some appUcations for 
monitoring metastatic patients that have feiled primary treatments. In general, PSP is a more 
sensitive biomarker than PAP, but is not as sensitive as PSA (Huang et al., 1993). Like PSA, 
PSP levels are firequently elevated in patients with BPH as well as tiiose witii prostate cancer. 

Another serum marker associated witii prostate disease is prostate specific membrane 
antigen (PSMA) (Horoszewicz et al., 1987; Carter and Coffey, 1996; Murphy et al., 1996). 
PSMA is a Type H cell membrane protein and has been identified as FoUc Acid Hydrolase 
(FAH) (Carter and Coffey, 1 996). Antibodies against PSMA react witii botii normal prostate 
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tissue and prostate cancer tissue (Horoszewicz et al., 1 987). Murphy et al. (1 995) used ELISA 
to detect serum PSMA in advanced prostate cancer. As a serum test, PSMA levels are a 
relatively poor indicator of prostate cancer. However, PSMA may have utiUty in certain 
circumstances. PSMA is expressed in metastatic prostate tumor capillary beds (Silver et al., 
1997) and is reported to be more abundant in thebloodofmetastatic cancer patients (Muiphy 
et al., 1996). PSMA messenger RNA (mRNA) is down-regulated 8-10 fold in the LNCaP 
prostate cancer ceU line after exposure to 5-alpha-dihydroxytestostetone (DHT) (Israeli et al., 
1994). 

Two relatively new potential biomarkers for prostate cancer are human kallekrein 2 
(HK2) (Piironen et al., 1 996) and prostate specific transglutaminase (pTGase) (Dubbink et al., 
1996). HK2 is a member of the kallekrein family that is secreted by the prostate gland 
(Piironen et al., 1996). Prostate specific transglutaminase is a calcium-dependent enzyme 
expressed in prostate cells that catalyzes post-translational cross-linking of proteins (Dubbink 
et al., 1996). In theory, serum concentrations of HK2 or pTGase maybe of utUity in prostate 
cancer detection or diagnosis, but the usefidness of these markers is still being evaluated. 

Interleukin 8 QL^S) has also been reported as amarker for prostate cancer. (Veltri et al., 
1999). Serum 11^8 concentrations were reported to be correlated with increasing stage of 
prostate cancer and to be capable of differentiating BPH from malignant prostate tumors. (Id.) 
The wide-scale applicability of this marker for prostate cancer detection and diagnosis is still 
under investigation. 

In addition to these protein markers for prostate cancer, several genetic changes have 
been reported to be associated with prostate cancer, including: allelic loss (Bova, et al., 1 993; 
Macoska et al.. 1994; Carter et al., 1990); DNA hypermethylation (Isaacs et al., 1994); point 
mutations or deletions of the retinoblastoma (Rb), p53 and KAIl genes (Bookstdn et al.. 
1990a; Bookstein et al., 1990b; Isaacs et al., 1991; Dong et al., 1995); and aneuploidy and 
ancusomy of chromosomes detectedby fluorescence in situ hybridization (FISH) (Macoska et 
al., 1994; Visakorpi et al., 1994; Takahashi et al., 1994; Alcaraz et al., 1994). None of these 
have been reported to exhibit sufficient sensitivity and specificity to be useful as general 
screening tools for asymptomatic prostate cancer. 

In current clinical practice, the serum PSA assay and digital rectal exam (DRE) is used 
to indicate which patients should have a prostate biopsy (Lithrup et al., 1994; Orozco et al.. 
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1998). Histological examination of the biopsied tissue is used to make the diagnosis of 
prostate cancer. Based upon the 1 89,500 cases of diagnosed prostate cancer in 1998 (Landis, 
1998) and aknown cancer detectionrate of about 35% (Parker et al.. 1996), it is estimated that 
in 1998 over one-half million prostate biopsies were performed in the United States(Oro2coet 
5 al., 1998; Veltrietal., 1998). Clearly, there would be much benefit derived from a serological 
test that was sensitive enougji to detect small and early stage prostate tumors that also had 
sufficient specificity to exclude a greater portion of patients with noncancerous or clinically 
insignificant conditions. 

There remain deficiencies in the prior art with respect to the identification of the genes 
10 linked with the progression of prostate cancer and the development of diagnostic methods to 
monitor disease progression. Likewise, the identification of genes, which are differentially 
expressed in prostate cancer, would be of considerable importance in the development of a 
rapid, inexpensive method to diagnose cancer. Although a few prostate specific genes have 
been cloned (PSA, PSMA, HK2, pTGase, etc.). these are typically not upregulated in prostate 
15 cancer. The identification of a novel, prostate specific gene that is differentially expressed in 
prostate cancer, compared to non-malignant prostate tissue, would represent a major, 
unexpected advance for the diagnosis, prognosis and treatinent of prostate cancer. 

The use of therapeutic antibodies for treatinent of cancers tiiat target surface proteins is 
known. Examples thereof include RITUXAN® that targets CD20 on B cell lymphoma, 
20 Campath® that targets a surface antigen CD52 expressed by chronic lymphocytic leukemia, 
Herceptin® tiiat targets erbB2 on breast and other cancers and Mybtara that targets CD33 
surface antigen expressed on leukemia cells. However, to date, a monoclonal antibody for 
tireatinent of prostate cancer has not been approved for therapeutic use. 

SUMMARY OF THE INVENTION 
The present invention relates to the identification of novel nucleic acid and amino acid 
sequences that are characteristic of prostate cancer cell or tissue, and which r^resent targets 
for therapy or diagnosis of such a condition in a subject 

The invention more specificaUy discloses 159 specific, isolated nucleic addmolecules 
that encode novel expression sequences. Of these, 122 are expressed sequence tags that are 
differentiaUyspUced and correspond to SEQ ID NOS 1-65,74,80,85, 102-134. 136,141, 146, 
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150-165, 167, 168. In addition, 42 specific isoforms of known genes have been identified 
corresponding to SEQ ID NOS. 67-72, 75-77, 81-83, 86-90. 92, 93, 95-98, 100, 101, 137-139, 
143, 144, 147-149, 169-173, 175, 177, 179, and 181. These novel sequences vJere found to be 

differentially expressed between normal prostate and prostate cancer. The expressed sequence 
tag represent novel exons that are alternatively spUced in prostate cancer, and as such, directly 
identify distinct isoforms. These sequences and molecules represent targets and valuable 
information to develop methods and materials for the detection, diagnosis, and treatment of 
prostate cancer. 



It is an object of the invention to provide methods and materials for treatment and 
diagnosis of prostate cancer. 

It is a more specific object of the invention to identify novel exons (novel spKce 
variants) that are expressed by prostate cancer tissue which are potential gene targets for 
treatment and diagnosis of prostate cancer. 

It is a specific object of the invention to develop novel therapies for treatonent of 
prostate cancer involving the administiation or use of anti-sense oUgonucleotides 
corresponding to novel gene targets that are specifically expressed by the prostate cancer. 

It is another specific object of the invention to identify exons and tiie corresponding 
protein domain encoded by those exons specifically upregulated in prostate cancer cells. 

It is another specific object of tiie invention to produce ligands that bind antigens 
encoded by the exons, expressed as a protein domain by certain prostate cancers, including, 
but not limited to, monoclonal antibodies. 

It is another specific object of the invention to provide novel therapeutic regimens for 

the tireatinentofprostate cancer that involve the administration oruseofantigensexpressedby 
certain prostate cancers, alone or in combination with adjuvants that elicit an antigen-specific 
cytotoxic T-cell lymphocyte response against cancer cells tiiat express such antigen. 

It is another object of the invention to provide novel therapeutic regimens for the 
treatonent of prostate cancer that involve the administration or use of Ugands, especiaUy 
monoclonal antibodies that specificaUy bind novel antigens that are expressed by certain 
prostate cancers. 
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It is an other object of this invention to provide phannaceutical compositions 
comprising a ligand or antigen as defined above, in combination with a phannaceuticaUy 
acceptable carrier or excipient and/or an adjuvant. 

Itisanotherobjectoftheinventiontoprovideanovelmethodfordiagnosisofprostate 
cancer by using ligands, e.g., monoclonal antibodies, which specifically bind to antigens that 

arespedficaUyexpressedbycertainprostate cancers, in order to detect whetherasubject has 
or is at increased risk of developing prostate cancer. 

It is another object of the invention to provide a novel method of detecting persons 
having, or at increased risk of developing prostate cancer by use of labeled DNAs that 
hybridize to novel gene targets expressed by certain prostate cancers. 

It is yet another object of the invention to provide diagnostic test kits for the detection 
of persons having or at increased risk of developing prostate cancer that comprise a Ugand, 
e.g., monoclonal antibody that specifically binds to an antigen expressed by prostate cancer 
ceUs, and a detectable label, e.g. indicator enzymes, a radiolabels, fluorophores, or 
paramagnetic particles. 

It is another object of the invention to provide diagnostic kits for detection of persons 
having or at risk of developing prostate cancer that comprise DNA primers or probes specific 
for novel gene targets specifically expressed by prostate cancer cells, and a detectable label, 
e.g. indicator enzymes, a radiolabels, fluorophores, or paramagnetic particles. 

It is another object of this invention to provide methods for selecting, identifying, 
screening, characterizing or optimizing biologically active compounds, comprising I 
determination of whether a candidate compound binds, preferably selectively, an antigen or a 
polynucleotide as disclosed in the present application. Such compounds represent drug 
candidates or leads for treating cancer diseases, particularly prostate cancer. 

It is another object ofthe invention to identify genes that are expressed in altered forms 
in prostate cancer cells. These forms represent splice variants of the gene, where the 
DATAS™ firagment either 1 ) indicates the splice event occurring within the gene, or 2) points 
to a gene that is actively spliced to produce different gene products. These different spUce 
variants or isoforms can be targets for therapeutic intervention. 
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LEGEND TO THE FIGURES 
^^^^ ^ • Expression of Sequence ID: No. 92 in nonnal human tissue. Primers were 
designed to detect the DATAS clone sequence and RT-PCR analysis was performed for 30 
cycles. Lane 1, Prostate; lane 2, Heart; lane 3, Lung; lane 4, Kidne3r, lane 5, Liver; lane 6, 
5 Brain; lane 7, Placenta; lane 8, Sk. Muscle; lane 9, Pancreas; lane 10, Spleen; lane ll] 
Thymus; lane 12, Testis; lane 13, Ovary, lane 14. Sm. Intestine; lane 15. Colon; lane 16 
Leukocyte. 

Fig5se2:Expression of clone(SEQ ID N092)in normal and tumorprostate samples. 
Primers were designed to detect the DATAS clone and RT-PCR analysis was performed for 40 
10 cycles. Individual RNA samples (normal and tumor) were tested both as pooled and as 
individual samples. The pooled RNA samples were used to produce cDNA using either an 
oligo dT approach (dT) or through a random primer protocol (RP). Individual patient cDNA 
samples (lanes 9-12) were pr^ared through the random primed protocol. Lane 1, prostate 
tumor pool 1 (RP cDNA); lane 2, normal prostate pool 1 (RP cDNA); lane 3. prostate tumor 
pool 2 (RP cDNA); lane 4, normal prostate pool 2 (RP cDNA); lane 5, prostate tumor pool 1 
(dT cDNA); lane 6, normal prostate pool 1 (dT cDNA); lane 7, normal prostate pool 2 (dT 
cDNA); lane 8, NTC; lane 9, Patient 1 (OHK); lane 10, Patient 2 (T523); lane 1 1, Patient 3 
(82B) ; lane 12, Patient 4 (4BK). 

AUgmnent of the different isofoims isolated from structural analysis of clone 
20 (DATAS clone number). The sequences isolated from the DATAS derived events were 
mapped using Blat against the Human genome to annotate the gene and determine the each 
unique spUcing event. Five events aremapped with AK092666, an EST that closely resembles 
the five events. 

Kgure_4 : Western blot analysis for the expression of STEAP2 isoforms. Protein 
25 extracts from prostate cancer ceU lines were separated on SDS_PAGE gels and transferred to 
nitrocellulose, and probed with an antibody raised against a peptide sequence present in the N- 
terminal portion of the wild type STEAP2 protein. Five different cell lines were analyzed: lane 
1) LNCaP; 2) 22Rvl 3) MDA-PCa2b; 4) PC3; 5) DU145. Tbte blot was developed using 
standard chemilxmiinescence reagents. 

30 
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DETAILED DE SCRIPTION OF THE INVENTION 
DATAS (Different Analysis of Transcripts with Alternative SpUcing) analyzes 
structural differences between expressed genes and provides systematic access to alterations 
5 in RNA splicing (disclosed in U.S. Patent No.6^51,590, the disclosure of which is 
incorporated by reference in its entirety). Having access to these spUced sequences, which are 
critical for cellular homeostasis, represents a usejfiil advance in functional genomics. 

The DATAS Technology generates two Ubraries when comparing two samples, such as 
10 normal vs. tumor tissue. Each library specifically contains clones of sequences that are present 
and more highly expressed in one sample. For example, library A will contain sequences that 
are present in genes in the normal samples but absent in the tumor samples. These sequences 
are identified as being removed or spUced out fix)m the genes in the tumor samples. In 
contrast, hT>raiy B will contain sequences that are present only in the tumor samples and not 
15 present in the normal samples. These represaitexons/introns that are alternatively spUced into 
genes expressed only in the tumor samples. 

The present invention is based in part on the identification of exons that are isolated 
using DATAS and then determined to be differentially regulated or expressed in prostate 

20 tumor samples. Specifically, 122 expressed sequence tags were identified through DATAS 
and confirmed to be differentially expressed between normal prostate tissue and prostate tumor 
tissue. These DATAS firagments (DF) are small sections of genes that are selected for 
inclusion or exclusion in one sample but not the other. These small sections are part of the 
expressed gene transcript, and can consist of sequences derived firom several different regions 

25 of the gene, including, but not limited to, portions of single exons, several exons, sequence 
from introns, and sequences firom exons and introns. This alternative usage of exons in 
different biological samples produces different gene products from the same gene through a 
process well known in the art as alternative RNA splicing. In particular, 37 alternatively 
spliced isoforms have been identified from the DATAS fragment sequences, and produce 

30 alternate gene products that fit all the descriptions of targets and gene products below. 
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Alternatively spliced mRNA's produced from the same gene contain different ribonucleotide 
sequence, and therefore translate into proteins with different amino acid sequences. Nucleic 

acid sequences that are alternatively spliced into or outofthe gene products can be inserted or 
deleted in frame or out of fi^e fi»m the original gene sequence. ITus leads to the translation 
of different proteins from each variant. Differences can include simple sequence deletions, or 
novel sequence information inserted into the gene product. Sequences inserted out of frame 
can lead to the production of an early stop codon and produce a truncated form of the protein. 
Alternatively, in-frame insertions of nucleic acid may cause an additional protein domain to be 
expressed from the mRNA. The end stage target is a novel protein containing either a novel 
epitope or function. Many variations ofknown genes have been identified and produce prot. 
variants that can be agonistic or antagonistic with the original biological activity of the prot. 



ein 
:eui. 



DATAS fragments thus identify genes and proteins which are subject to differential regulation 
and alternative spUcing(s) in prostate cancer cells. DATAS fragments thus allow the definition 
of target molecules suitable for diagnosis or therapy of prostate cancers, which target 
molecules comprise all or a portion of genes or RNAs comprising the sequence of a DATAS 

fi:agment,orof genes or RNAfrom which the sequenceofaDATAS fragment derives, as weU 
as corresponding polypeptides or proteins, and variants thereof 

A first type of target molecule is a target nucleic acid molecule comprising the sequ«ice of a 
Ml gene or RNA molecule comprising the sequence of a DATAS fragment as disclosed in the 
present appUcation. Indeed, since DATAS identifies genetic deregulations associated with 
prostate tumor, the whole gene or RNA sequence from which said DATAS fragment derives 
can be used as a target of therapeutic intervention or diagnosis. 

Similariy, another type of target molecule is a target polypeptide molecule comprising the 
sequence of a fiiU-length protein comprising the amino acid sequence encoded by a DATAS 
fiagment as disclosed in the present application. 

A fiirther type of target molecule is a target nucleic acid molecule comprising a fragment of a 
gene or RNA as disclosed above. Indeed, since DATAS identifies genes and RNAs tiiat are 
altered in prostate tumor cells, portions of such genes or RNAs, including portions that do not 
comprise the sequence of a DATAS fragment, can be used as a target for therapeutic 
intervention or diagnosis. Examples of such portions include : DATAS fragments, portions 



wo 2004/113571 



11 



PCT/IB2004/002394 



thereof, alternative exons or introns of said gene or RNA, exon-exon, exon-intron or intron- 
intron junction sequences generated by splicing(s) in said RNA, etc. Particular portions 
comprise a sequence encoding a exbra-cellular domain of a polypeptide. 
Similarly, another type of target molecule is a fragment of a protein comprising the amino acid 
sequence encoded by a DATAS fragment as disclosed in the present appUcation. Such 
fragments may comprise or not the DATAS sequence, and may comprise newly generated 
amino add sequence resulting, for instance, fix)m a frame shift, a novel exon-exon or exon- 
intron junction, tiie creation of new stop codon, etc. 

These target molecules (including genes, fragments, proteins and tiieir variants) can serve as 
diagnostic agents and as targets for tiie development of therapeutics. For example, these 
therapeutics may modulate biological processes associated with prostate tumor viabiUty. 
Agents may also be identified that are associated with tiie induction of apoptosis (cell death) in 
prostatetiimorcells. Other agents can also be developed, such as monoclonal antibodies, fliat 
bind to flie protein or its variant and alter the biological processes important for cell gro wtii. 
Alternatively, antibodies can deliver a toxin which can inhibit cell growtii and lead to cell 
death. 



Specifically, the invention provides sequences tiiat are expressed in a variant protein 
and are prostate hmior specific or prostate specific. These sequences are portions of genes 
identified to be in the plasma membrane of flie cell through bioinfonnatic analysis, and the 
specific sequences of the invention are expressed on the extiaceUular region of the protein, so 
that the sequences may be usefijl in the preparation of prostate tiimor vaccines, including 
prophylatic and therapeutic vaccines. 

Based thereon, it is anticipated that tiie disclosed genes tiiat are associated witii tiie 
differentially expressed sequences and tiie corresponding variant proteins should be suitable 
targets for prostate cancer tiierapy, prevention or diagnosis, e.g. for tiie development of 
antibodies, small molecular inhibitors, anti-sense tiierapeutics, and ribozymes. The potential 
flierapies are described in greater detail below. 

Such tiierapies wUl include tiie syntiiesis of oUgonucleotides having sequences in the 
antisense orientation relative to tiie subject nucleic acids which appear to be up-regulated in 
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prostate cancer. Suitable therapeutic antisense oligonucleotides will typically vary in length 
from two to several hundred nucleotides in length, more typically about 50-70 nucleotides in 
length or shorter. These antisense oligonucleotides may be administered as naked nucleic 
acids or in protected forms, e.g., encapsulated in Uposomes. The use of liposomal or oflier 
protected forms may be advantageous as it may enhance in vivo stabiUty and thus fadHtate 
delivery to target sites, i.e., prostate tumor cells. 

Also, the subject novel genes may be used to design novel ribozymes that target the 
cleavageofthecorrespondingmRNAsinprostatetumorcells. Similarly, these ribozymes may 
be administered in free (naked) form or by the use of delivery systems that enhance stabiUty 
and/or targeting, e.g., liposomes. 

Also, the present invention embraces the administration of use of nucleic adds that 
hybridize to the novd nucleic add targets identified infra, attached to therapeutic effector 
moieties, e.g., radiolabds, (e.g., 'V, '^'T) cytotoxins, cytotoxic enzymes, and the like in order 
to selectively target and kill cells that express these nucldc adds, i.e., prostate tumor cells. 

Also, the present invention embraces the treatment and/or diagnosis of prostate cancer 

by targetmg altered genes or the corresponding altered proteinparticularly splice variants th^ 
are expressed in altered form in prostate tumor cells. These methods will provide for the 
selective detection of cells and/or eradication of cells that express such altered forms thereby 
minimizing adverse effeds to normal cells. 

Still fiulher, the present mvention encompasses non-nucleic add based therapies. For 
example, the invention encompasses the use of a DNA containing one of the novd cDNAs 
correspondingto novd antigen identifiedheidn. It is antidpated that the antigens so encoded 
may be used as therapeutic or prophylactic anti-tumor vaccines. For example, a particular 
contemplated application of these antigens involves their administration with adjuvants that 
induce a cytotoxic T lymphot^e response. 

Administration of the subject novd antigens in combination with an adjuvant may 

result inahumoralimmuneresponse against such antigens, therebydelaying or preventingthe 
development of prostate cancer. 

These embodiments of the invention will comprise administration of one or more of 
the subjed novd prostate cancer antigens, ideally in combination with an adjuvant, e.g.. 
PROVAX™ (as disclosed U.S. Patents Nos. 5,709,860, 5,695,770, and 5,585,103, which 
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comprises a microfluidized adjuvant containing Squalene, Tween and Pluronic), ISCOM'S®. 
DETOX®, SAP, Freund's adjuvant. Alum®, Saponin®, among others. This composition will 
be administered in an amount sufficient to be therapeutically or prophylactically effective, e.g. 
on the order of 50 to 20,000 mg^g body weight, 100 to 5000 mg^g body weight 

Yet another embodimentof the invention will comprise thepreparationofmonoclonal 
antibodies against the antigens encoded by the novel genes containing the nucleic acid 
sequences disclosed infra. Such monoclonal antibodies may be produced by conventional 
methods and include human monoclonal antibodies, humanized monoclonal antibodies, 
chimeric monoclonal antibodies, single chain antibodies, e.g., scFv's and antigen-binding 
antibody fragments such as Fab and Fab' fragments. Methods for the preparation of 

monoclonal antibodies are known in the art. In general, preparationofmonoclonal antibodies 
will comprise immunization of an ^priate (non-homologous) host with the subject prostate 
cancer antigens, isolation of immune cells therefrom, use of such immune cells to isolate 

monoclondanta^odies and screeningformonoclonal antibodies that specificallybind to either 
of such antigens. Antibody fragments may be prepared by known methods, e.g., enzymatic 
change of monoclonal antibodies. 

These monoclonal antibodies and fragments will be useful for passive anti-tumor 
immunotherapy, or may be attached to therapeutic effector moieties, e.g., radiolabels, 
cytotoxins, therapeutic enzymes, agents that induce apoptosis, and the like in order to providi 
for targeted cytotoxicity, i.e., killing of human prostate tumor cells. CJiven the fact that the 
subject genes are apparently not significantly expressedbymanynoimal tissues this should not 
result in significant adverse side effects (toxicity to non-target tissues). 

In one embodiment, of the present invention such antibodies or fragments will be 
administered in labeled or unlabeled form, alone or in conjunction with other therapeutics, 
e.g., chemotherapeutics such as cisplatin, methotrexate, adriamycin. and the like suitable for 
prostate cancer therapy. The administered composition will also typically include a 
pharmaceutically acceptable carrier, and optionally adjuvants, stabilizers, etc., used in antibody 
compositions for therapeutic use. 

Preferably, the subject monoclonal antibodies wUl bind the target antigens with high 
affinity, e.g., possess a binding afBnity (Kd) on the order of lO"* to lO"'^ M. 
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As noted, the present invention also embraces diagnostic applications that provide for 
detection of the expression of prostate specific splice variants disclosed herein. This wiU 

comprise detecting the expressionofone or moreofthese genes at the RNAlevd and/or at the 
protein level. 

For nucleic acids, expression of the subject genes will be detected by known nucleic 
acid detection methods, e.g.. Northern blot hybridization, strand displacement amplification 
(SDA), catalytic hybridization amplification (CHA), and other known nucleic acid detection 
methods. Preferably, a cDNA library will be made from prostate cells obtained from a subject 
to be tested for prostate cancer by PGR using primers corresponding to the novel isoforms 
disclosed in this application. 

The presence or absence of prostate cancer can be determined based on whether PGR 
products are obtained, and the level of expression. The levels of expression of such PGR 
product may be quantified m order to determine the prognosis of a particular prostate cancer 
patient (as the levels of expression of the PGR product often will increase or decrease 
significantly as the disease progresses.) Thismay provide a method for monitoring the status 
of a prostate cancer patient. 

Alternatively, the status of a subject to be tested for prostate cancer may be evaluated 
by testing biological fluids, e.g., blood, urine, lymph, and the like with an antibody or 
antibodies or fragment that specifically binds to the novel prostate tumor antigens disclosed 
herein. 

Methods forusing antibodies to detect antigen expression are well known and include 
ELISA, competitive binding assays, and the like. Jn general, such assays use an antibody or 
antibody fragment lhat specifically binds the target antigen directly or indirectly bound to a 
label that provides for detection, e.g. indicator enzymes, a radiolabels, fluorophores. or 
paramagnetic particles. 

Patients which test positive for the enhanced presence of the antigen on prostate cells 
will be diagnosed as having or being at increased risk of developing prostate cancer. 

Additionally, the levels of antigen expression may be useful in detennining patient status, 
how far disease has advanced (stage of prostate cancer). 

As noted, thepresentinventionprovides novel spUce variants that encode antigens that 
correlate to humanprostate cancer. The present invention also embraces variants thereof. As 
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used herein "variants" means sequences that are at least about 75% identical thereto, more 
preferably at least about 85% identical, and most preferably at least 90% identical and still 
more preferably at least about 95-99% identified when these DNA sequences are compared to 
a nucleic acid sequence encoding the subject DNAs or a fragment thereof having a size of at 
least about 50 nucleotides. This includes allelic and splice variants ofthe subject genes. The 
present invention also encompasses nucleic acid sequences thathybridize to the subject spUce 
variants under high, moderate or low stringency conditions e.g., as described infra. 

Also, the present invention provides forprimer pairs thatresult in the amplification of 
DNAs encoding the subject novel genes or a portion thereof in an mRNA library obtained 
Scorn a desired cell source, typically human prostate cell or tissue sample. Typically, such 

primers will beontheorderof 12 to 50 nucleotides inlength, and will be constructed sudi that 
they provide for amplification ofthe entire or most ofthe target gene. 

Also, the invention embraces the antigens encoded by the subject DNAs or fragments 
thereof that bind to or elicits antibodies specific to the ftiU-length antigens. Typically, such 
fi:agments wiU be at least 1 0 amino adds in lengfli, more typically at least 25 amino adds in 
length. 

As noted, the subject DNA firagments are expressed in a majority of prostate tumor 
samples tested. The invention fiirfher contemplates the identification of other cancers that 
express such genes and the use thereof to detect and treat such cancers. For example, the 
subject DNA fragments or variants thereof may be expressed on other cancers, e.g., breast, 
ovary, pancreas, lung or prostate cancers. Essentially, the present invention embraces the 
detection of any cancer wherein the expression ofthe subject novel genes or variants thereof 
coirdate to a cancer or an increased likeUhood of cancer. To facilitate under-study ofthe 
invention, the following definitions are provided. 

"Isolated tumor antigen or tumor protein" refers to any protdu that is not in its normal 

cellularenvironment. This includes by way of examplecompositionscomprisingrecombinant 
proteins encodedby the genes disclosed mfra, pharmaceutical compositions comprising such 
purified proteins, diagnostic compositions comprising such purified proteins, and isolated 
protein compositions comprising such proteins, hipreferred embodiments, an isolated prostate 

tumor protein accordingto the invention will compriseasubstantiallypureprotein, in thatit is 
substantially free of other protdns, preferably that is at least 90% pure, that comprises the 
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amino acid sequence contained herein or natural homologues or mutants having essentially the 
same sequence. A naturally occurring mutant might be found, for instance, in tumor cells 
expressing a gene encoding a mutated protein according to the invention. 

••Native tumor antigen or tumor protein" referatoapioteinlhatisanon-hmnan primate 
5 homologue of the protem having the amino add sequence contained infra. 

"Isolated prostate tumor gene or nucleic acid sequence" refers to a nucleic add 
molecule that encodes a tumor antigen according to the mvention which is not in its normal 
human cellular environment, e.g., is not comprised in the human or non-human primate 
chromosomal DNA. This includes by way of example vectors that comprise a gene according 
10 to the invention, a probe that comprises a gene according to the invention, and a nucleic add 
sequence directly or indirectly attached to a detectable moiety, e.g. a fluorescent or radioactive 
label, or a DNA fusion that comprises a nucldc add molecule encoding a gene according to 
theinventionfused atits 5' or3' end to a different DNA, e.g. a promoter or a DNA encoding a 
detectablemarker or effector moiety. Also included are natural homologues ormutants having 
15 substantially the same sequence. Naturally occurring homologies that are degenerate would 
emcode the same protein includingnucleotide differences that do not change the corresponds^ 
amino add sequence. Naturally occurring mutants might be found in tumor cells, wherein 
sudi nucleotide differences may result in a mutant tumor antigen. Naturally occurring 
homologues containing conservative substitutions are also encompassed. 

"Variant of prostate tumor antigen or tumor protein" refers to a protein possessing an 
amino acid sequence that possess at least 90% sequence identity, more preferably at least 91% 
sequence identity, even more preferably at least 92% sequence identity, still more preferably at 
least 93% sequence identity, still more preferably at least 94% sequence identity, even more 
preferably at least 95% sequence identity, still more preferably at least 96% sequence identity, 
25 even more preferably at least 97% sequence identity, stiU more preferably at least 98% 
sequence identity, and most preferably at least 99% sequence identity, to the corresponding 
native tumor antigen wherdn sequence identity is as defined infra. Preferably, this variant will 
possess at least one biological property in common with the native protein. 

"Variant of prostate tumor gene or nudeic acid molecule or sequence" refers to a 
30 nuddc add sequence that possesses at least 90% sequence identity, more preferably at least 
91%, more preferably at least 92%, evenmore preferably atleast 93%. StiU more preferably at 
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least 94-/0. even more preferably at least 95o/o. still more preferably at least 96%, even more 
preferably at least 97o/o, even more preferably at least 9Z% sequence identity, and most 
preferably at least 99o/„ sequence identity, to the corresponding native human nucleic add 
sequence, wherein "sequence identity" is as defined infira. 

'Tragmentofpiostate antigen encoding nucleicacidmoleculeor sequence-refers toa 
nucleic add sequence corresponding to a portion of the native human gene wherein said 
portion is at least about 50 nucleotides in length, or 100, more preferably at least 150 
nucleotides in length. 

"Antigenic fragments ofprostate tumor antigen" refer to polypeptides corresponding to 
a fragment of a prostate protein or a variant or homologue thereof that when used itself or 
attached to an immunogenic carrier elicits antibodies that spedficaUy bind the protdn. 
Typically such antigenic fragments will be at least 8-15 amino acids in lengfli. and may be 
much longer. 

Sequence identity or percent identity is intended to mean the percentage of the same 

residues sharedbetweentwo sequences, referenced to humanprotdnAorprotdnBorgeneA 
or gene B. when the two sequences are aligned using the Clustal method [Higgins et al, Cabios 
8:189-191 (1992)] of multiple sequence alignment in the Lasergene biocomputing software 
(DNASTAR. INC, Madison, WI), or aligmnent programs avaUable from the Genetics 
Computer Group (GCG Wisconsin package, Accelrys, San Diego. CA). m this method, 
multiple aUgmnents are carried out in a progressive mamier. in which larger and large^ 
aligmnent groups are assembled using similarity scores calculated from a series of pairwise 
aligmnents. Optimal sequence aUgmnents are obtained by finding the maximum aligmnent 
score, which is the average of all scores between the separate residues in the aUgnment 
detemuned from a residue wdght table representing fee probability of a given amino add 
change occurring in two related protdns over a given evolutionary interval. Penalties for 

openingandlengtheninggapsin the aUgmnent contribute to thescore.Thedefaultparameters 
used wifli this pix>gram are as follows: gap penalty for multiple aligmnent=10; gap length 
penalty for multiple ahgmnent=10; k-tuple value in pairwise aligmnent=l; gap penalty in 
pairwise aligmnent=3; window value in pairwise aUgmnent=5; diagonals saved in pairwise 
aligmnent=5. The residue wdght table used for the aUgmnent program is PAM250 
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[Dayhofifet al., in Atlas of Protein Sequence and Structure, Dayhofi^ Ed, NDRF, Washington, 
Vol. 5, suppL 3, p. 345, (1978)]. 

Percent conservation is calculated from the above aUgmnent by adding the percentage 
of identical residues to the percentage of positions at which the two residues represent a 
conservative substitution (defined as having a log odds value of greater than or equal to 0.3 in 
the PAM250 residue weight table). Conservation is referenced to human Gene A or gene B 
when determining percent conservation with non-human Gene A or gene B. e.g. mgene A or 
gene B, when determining percent conservation. Conservative amino acid changes satisfying 
this requirement include: R-K; B-D, Y-F, L-M; V-I, Q-H. 

Polypeptide Fragments 

The invention provides polypeptide firagments of the disclosed proteins. Polypeptide 
fragments of the invention can comprise at least 8, more preferably at least 25, still more 
preferably at least 50 amino acid residues of the protein or an analogue thereof. More 
particularly such fragment will comprise at least 75, 100, 125, 150, 175, 200, 225, 250, 275 
residues of the polypeptide encoded by the corresponding gene. Even more preferably, the 
protein fragment wiU comprise the majority of the native protein, e.g. about 100 contiguous 
residues of the native protein. 

Biologically Active Variants 

The invention also encompasses mutants of the novel prostate proteins disclosed infra 
which comprise an amino acid sequence that is at least 80%, more preferably 90%, stUl more 
preferably 95-99% similar to the native protein. 

Guidance in determining which amino acid residues can be substituted, inserted, or 

deleted without aboKshingbiological or immunological activity can be found using computer 
programs well known in the art, such as DNASTAR or software from the Genectics Computer 
Group (GCG). Preferably, amino acid changes in protein variants are conservative amino acid 
changes, i.c, substitutions of similarly charged or uncharged amino acids. A conservative 
amino acid change involves substitution of one of a family of amino acids which are related in 
their side chains. Naturally occurring amino acids are generally divided into four families: 
acidic (aspartate, glutamate), basic (lysine, arginine, histidine), non-polar (alanine, valine. 



wo 2004/113571 



19 



PCT/IB2004/002394 



leucine, isoleucine, proUne, phenylalanine, methionine, tryptophan), and uncharged polar 
(glycine, asparagine, glutamine, cystine, serine, threonine, tyrosine) amino adds. 
Phenylalanine, tryptophan, and tyrosine are sometimes classified jointly as aromatic amino 
acids. 

A subset of mutants, called muteins. is a group of polypeptides in which neutral amino 

acids, sudi as serines, aresubstituted for cysteineresidueswWchdonotpaxtidpateindisulfi^^ 
bonds. These mutants may be stable over a broader temperature range than native secreted 
proteins. See Mark et al, U.S. Patent 4,959,3 14. 

Itisreasonableto expect that an isolatedreplacementofaleucine with anisoleudneor 
valine, an aspartate with a glutamate, a threonine with a serine, or a similar replacement of an 

amino add withastructurallyrelated amino acid will nothaveamajor effect onlhebiologic^ 
properties of the resulting secreted protein or polypeptide variant. 

Protein variants include glycosylated forms, aggregative conjugates with other 

molecules, and covalent conjugates with umelated chemical moieties. Also, protein variants 

also includeaUeUc variants, species variants, andmutdns. Truncations ordeletiom 

wMdi do not affed the differential expression ofthegenearedso variants. Covalentvm^^ 

canbepreparedbylinkingfundionaUties to groups which are foundintheaminoaddch^ 
at the N- or C-temiinal residue, as is known in the art. 

It wiU be recognized in the art that some amino acid sequence of the prostate protdns 
of the invention can be varied without significant effect on the structure or fimdion of the 
protein. If such differences in sequence are contemplated, it should be remembered that there 
are critical areas on the protein which detemune adivity. In general, it is possible to replace 
residues that form the tertiary structure, provided that residues performing a similar fimction 
are used. In other instances, the type of residue may be compldely unimportant if the 
alteration occurs at a non-critical region of the protdn. The replacement of amino acids can 
also change theseledivityofbindingto cell surfece receptors. Ostade d ^\.,Nature 361:266- 
268(1993) describes certainmutationsresultinginselectivebindingoflW-alpha to o^^ 

ofthe two known types ofT>n^ receptors. Thus, thepolypeptidesofthepresentinvention^^ 
mclude one or more amino acid substitutions, deletions or additions, dther from natural 
mutations or human manipulation. 
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The invention further includes variations of the prostate proteins disclosed infra which 
show comparable expression patterns or which include antigenic regions. Such mutants 
include deletions, insertions, inversions, repeats, and site substitutions. Guidance concerning 

which amino acid changes are likely to be phenotypically silent canbe found inBowie,J.U.,et 

al., "DecipheringtheMessageinProtein Sequences: Tolerance to Amino Add Substitat^^^^^ 
&;/ewce 247:1306-1310 (1990). 

Of particular interest are substitutions of charged amino acids with another charged 
amino acid and with neutral or negatively charged amino acids. The latter results in proteins 
with reduced positive charge to improve the characteristics of the disclosed protein. The 

prevention ofaggregationis highly desirable. Aggregation ofproteinsnotonlyresultsinaloss 

of activity but can also be problematic when preparingphaimaceutical formulations, because 
they can be immunogenic. (Pinckard et al., Clin. Exp. Immunol 2:33 1-340 (1 967); Robbins et 
^KDiabetes 36:838-845 (1987); Cleland et al., Crit. Rev. Therapeutic Drug Carrier Systems 
10:307-377 (1993)). 

Amino acids in the polypeptides of the present invention that are essential for function 
can be identified by methods known in the art, such as site-directed mutagenesis or alanine- 
scanning mutagenesis (Cunningham and Wells, Science 244: 1081-1085 (1989)). The latter 
procedure introduces single alanine mutations at every residue in the molecule. The resulting 
mutant molecules are then tested for biological activity such as binding to a natural or 
synthetic binding partner. Sites that are critical for ligand-receptor binding can also be 
determined by structural analysis such as crystallization, nuclear magnetic resonance or 
photoafifinity labeling (Smith et al., JMol Biol. 224:899-904 (1992) and de Vos et al. Science 
255:306-312 (1992)). 

As indicated, changes arepreferably of a minor nature, such as conservative amino acid 
substitutions that do not significantly affect the folding or activity of the protein. Of course, 

thenumber of amino acid substitutionsaskiUed artisan wouldmake depends on manyfactors! 
including those described above. Generally speaking, the number of substitutions for any 
given polypeptide will not be more than 50, 40, 30, 25, 20, 15, 10, 5 or 3. 
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Fusion Proteins 

Fusion proteins comprising proteins or polypeptide fiagments of the subject prostate 
tumor antigen can also be constructed. Fusion proteins are useful for generating antibodies 
agamst amino acid sequences and for use in various assay systems. For example, fusion 
proteins can be used to identify proteins which interact with a protein of the invention or 
which interfere with its biological function. Physical methods, such as protein affinity 

chromatography,or Hbraiy-based assays for protem-protein interactions, such as the yeast two- 
hybrid or phage display systems, can also be used for this purpose. Such methods are well 
known in the art and can also be used as drug screens. Fusion proteins comprising a signal 

sequence and/oratransmembranedomainofaproteinaccordingto the invention orafiBgment 
thereof can be used to target other protein domains to cellular locations in which the domains 
are not normally found, such as bound to a cellular membrane or secreted extracellularly. 

A fusion protein comprises two protein segments fused together by means of apeptide 
bond. As noted, these fragments may range in size from about 8 amino adds up to the full 
length of the protein. 

The second protein segment can be a full-length protein or a polypeptide fragment. 
Protems commonly used in fusion protein construction include fi-galactosidase, J3- 
glucuronidase, green fluoresc«it protein (GFP), autofluorescent proteins, including blue 
fluorescent protein (BFP), glutathione-S-transferase (GST), luciferase, horseradish peroxidase 
(HRP), and chloramphenicol acetyltransferase (CAT). Additionally, epitope tags can be used 
in fusion protein constructions, including histidine (His) tags, FLAG tags, influenza 
hanagglutinin (HA) tags, Myc tags, VSV-G tags, and thioredoxin (Trx) tags. Other fusion 
constructions can include maltose binding protein (MBP), S-tag, Lex a DNA binding domain 
(DBD) fusions, GAL4 DNA binding domain fusions, and herpes simplex virus (HSV) BP 16 
protein fusions. 

These fusions can be made, for example, by covalently linking two protein segments or 
by standard procedures in the art of molecular biology. Recombinant DNA methods can be 
used to prepare fusion proteins, for example, by making a DNA construct which comprises a 
coding sequence encoding a possible antigen according to the invention or a fragment thereof 

in proper reading frame withanucleotide encoding the secondprotein segment and expressing 
the DNA construct in a host cell, as is known in the art. Many kits for constructing fusion 
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proteins are available ftom companies that supply research labs with tools for experiments, 
including, for example, Promega Corporation (Madison, WI), Stratagene (La JoUa. CA).' 
Clontech (Mountain View, CA), Santa Cruz Biotechnology (Santa Cruz. CA). MBL 
Intemational Corporation (MIC; Watertown, MA), and Quantum Biotechnologies (Montreal. 
Canada; 1-888-DNA-KITS). 

Proteins, fusion proteins, or polypeptides of the invention can be produced by 
recombinant DNA methods. For production of recombinant proteins, fosion proteins, or 
polypeptides, a sequence encoding the protein can be expressed in prokaryotic or eukaryotic 
host cells using expression systems known in the art. These expression systems include 
bactCTial, yeast, insect, and mammalian cells. 

Theresulting expressed protein can then be purified from the culture medium or fiom 
extracts of the cultured cells using purification procedures known in the art. For example, for 

proteins fullysecreted into the culturemedium,ceU-freemediumcanbe diluted with so<^um 
acetate and contacted wi& a cation exchange resin, followed by hydrophobic interaction 
chromatography. Using this method, the desired protein or polypeptide is typically greater 
tiian 95% pure. Further purification can be undertaken, using, for example, any of the 
techniques listed above. 

It may be necessary to modify a protein produced in yeast or bacteria, for example by 
phosphorylation or glycosylation of the appropriate sites, in order to obtain a functional 
protein. Such covalent attachments can be made using known chemical or enzymatic methods. 

Aproteinorpolypeptideofthe invention can also be expressedinculturedhost cells in 
a fomx which will faciUtate purification. For example, a protem or polypeptide can be 
expressed as a fijsion protein comprising, for example, maltose binding protein, glutathione-S- 
transferase, or thioredoxin. and purified using a commercially available kit. Kits for 

expression andpurification of such fi^onproteins are availablefrom companies such asNew 
England BioLabs. Pharmacia, and Invitrogen. Proteins, fission proteins, or polypeptides can 
also be tagged with an epitope, such as a 'Tlag» epitope (Kodak), and purified using an 
antibody which specifically binds to that epitope. 

nie coding sequenceoftheprotein variants identified through the sequences disclosed 
herein can also be used to construct transgenic animals, such as mice, rats, guinea pigs. cows, 
goats, pigs, or sheep. Female transgenic animals can then produce proteins, polypeptides, or 



wo 2004/113S71 



23 



PCT/ro2004/002394 



fusion proteins of the invention in their milk. Methods for constructing such animals are 
known and widely used in the art. 

Alternatively, synthetic chemical methods, such as solid phase peptide synthesis, can 
be used to synthesize a secreted protein or polypeptide. General means for the production of 
peptides, analogs or derivatives are outlined in Caiemistry and Biochemistry of Amino Acids, 
Peptides, and Proteins ~ A Survey of Recent Developments, B. Weinstefai, ed. (1983). 

Substitution ofD-amino acids for the normal L-stereoisomer can be carried out to increase the 
half-life of the molecule. 

TypicaUy, homologous polynucleotide sequences can be confiimed by hybridization 
under stringent conditions, as is known in the art. For example, using the following wash 
conditions: 2 x SSC (0.3 M NaCl, 0.03 M sodium citrate, pH 7.0), 0.1% SDS, room 
temperature twice, 30 minutes each; then 2 x SSC, 0. 1 % SDS, 50 °C once, 30 minutes; then 2 
X SSC, room temperature twice, 10 minutes each, homologous sequences can be identified 
which contain at most about 25-30% basepair mismatches. More preferably, homologous 
nucleic acid strands contain 15-25% basepair mismatches, even more preferably 5-15o/o 
basepair mismatches. 

The invention also provides polynucleotide probes which can be used to detect 
complementaiynucleotide sequences, for example, in hybridization protocols such as Northern 
or Southern blotting or in situ hybridizations. Polynucleotide probes of the invention comprise 
at least 12, 13, 14, 15, 16, 17, 18, 19, 20, 30, or 40 or more contiguous nucleotides of the 
nucleic acid sequences providedherein. Polynucleotide probes ofthe invention can comprise 
a detectable label, such as a radioisotopic, fluorescent, enzymatic, or chemiluminescent label. 

Isolated genes corresponding to the cDNA sequences disclosed herein are also 
provided. Standardmolecularbiologymethodscanbeusedtoisolatethecorrespondinggenes 
using the cDNA sequences provided herein. These methods include preparation of probes or 

primers fi-om the nucleotide sequence disclosed herein for use inidentifying or ampUfyingthe 
genes fix)m mammaUan, including human, genomic libraries or other sources of human 
genomic DNA. 

Polynucleotide molecules of the invention can also be used as primers to obtain 
additional copies of the polynucleotides, using polynucleotide ampUfication methods. 
Polynucleotide molecules can be propagated in vectors and ceU hues using techniques weU 
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knownintheart. Polynucleotide molecules can be on linear or circular molecules. Theycan 
be on autonomously replicating molecules or on molecules without replication sequences 
They can be regulated by their own or by other regulatory sequences, as is known in the art. 

Polynucleotide Constructs 

Polynucleotide molecules comprising the coding sequences of the gene variants 
identified through the sequences disclosed herein can be used in a polynucleotide construct, 

such asaDNAorRNA construct. Polynucleotidemoleculesofthe invention canbe used, for 
example, in an expression construct to express all or a portion of a protein, variant, fi^ion 

protein, or single-chainantibodyinahost cell. An expression construct comprisesapromoter 
which is functional in a chosen host cell. The skilled artisan can readily select an appropriate 
promoter from the large number of cell type-specific promoters known and used in the art. 
■nie expression construct can also contain a transcription terminator which is functional in the 
host cell. The expression construct comprises a polynucleotide segment which encodes aU or a 
portion of the desired protein. TTie polynucleotide segment is located downstream from the 
promoter. Transcription of the polynucleotide segment initiates at the promoter. The 
expression construct can be linear or dix^ular and can contain sequences, if desired, for 
autonomous replication. 

Also included are polynucleotide molecules comprising the promoter and UTR 
sequences of the subject novel geaes, operably Unked to the associated protein coding 
sequence and/or other sequences encoding a detectable or selectable marker. Such promoter 
and/or UTR-based constructs are useful for studying the transcriptional and translational 
regulation of protein expression, and for identifying activating and/or inhibitory regulatory 
proteins. 



Host Cells 



An expression construct can be introduced into a host cell. The host cell comprising 
the expression construct can be any suitable prokaiyotic or eukaryotic cell. Expression 

systems inbacteria include those described in Chang e^«/..^a/«re275:615(1978);C3oeddel^ 
al. Nature 281: 544 (1979); Goeddel etal. Nucleic Acids Res. 8:4057 (1980); BP 36 776' 
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U.S. 4.551.433; deBoer a/.. Proc. Natl AcadScl USA 80: 21-25 (1983); and Siebenlist et 
al. Cell 20: 269 (1980). 

Expression systems in yeast include those described in Himmen et al, Proc Natl 
Acad. Sci. USA 75: 1929 (1978); Ito etal, J Bacterial 153: 163 (1983); Kv^etal., Mol Cell 
Biol 6: 142 (1986); Kunze et al, J Basic Microbiol 25: 141 (1985); Gleeson et al, J. Gen 
Microbiol 132: 3459 (1986), Roggenkamp a/., Mol Gen. Genet. 202: 302 (1986)); Das et 
JBacteriol. 158: 1165 (1984); DeLouvencourtef a/. V^acW. 154:737(1983). Van den 
Berg etal, Bio/Technology S: 135 (1990), Kvmzo^al., J. Basic Microbiol 25: 141 (1985); 
Cregg et al, Mol Cell Biol 5: 3376 (1985); U.S. 4,837,148; U.S. 4,929.555; Beach and 
Niirse,iVaft.re300: 706 (1981); Davidow etal, Curr. Genet. 10: 380 (1985); Gaillaidine^a/., 
Curr. Genet. 10: 49 (1985); Ballance et al. Biochem. Biophys. Res. Commun. 112: 284-289 
(1983); Tilbum et al. Gene 26: 205-22 (1983); Yelton etal, Proc. Natl Acad, Scl USA 81: 
1470-1474 (1984); Kelly and Hynes. EMBO J. 4: 475479 (1985); EP 244.234; and WO 
91/00357. 

Expression of heterologous genes in insects can be accompUshed as described in U.S. 
4.745.051 ; Friesen et al (1986) 'The Regulation of Baculovirus Gene Expression" in- THE 
MOLECULAR BIOLOGY OF BACULOVIRUSES (W. Doerfler, ed.); EP 127,839; EP 
155,476; Vlsketal, J. Gen. Virol 69: 765-776 (1988); Miller etal. Ann. Rev. Microbiol 42: 
177 (1988); Carbonell etal. Gene 73: 409 (1988); Maeda et al. Nature 315: 592-594 (1985)- 
Lebacq-Verheyden et al. Mol Cell Biol 8: 3129 (1988); Smith etal. Proc. Natl Acad Sci 
USA 82: 8404 (1985); Miyajima et al. Gene 58: 273 (1987); and Martin et al, DNA 7:99 
(1 988). Numerous baculoviral strains and variants and corresponding permissive insect host 
cells from hosts are described in Luckow et al, Bio/Technology (1988) 6: 47-55, Miller et al 
in GENETIC ENGINEERING (Setlow, J.K. et al eds.). Vol. 8. pp. 277-279 (Plenum 
PubUshing, 1986); and Maeda a/.. Nature, 315: 592-594 (1985). 

MammaKan expression can be accompUshed as described in Dijkema et al. EMBO J. 
4: 761(1985); Gormanetal. Proc. Natl Acad Scl USA 79: 6777 (1982b); Boshart er«/.. Cell 
41: 521 (1985); and US 4,399,216. Other features of mammalian expression can be 
fadUtated as described in Ham and Wallace, Meth Enz. 58: 44 (1979); 

Expression constructs can be introduced into host cells using any technique known in 
the art. These techniques include transfenrin-polycation-mediated DNA transfer, transfection 
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h^p.m«»pro^wMchmaybeisola,edbyvirtueofc.<,sa..^^ 
P^^ encoded by a gene described herein or whose encoding n«=.eo.ide se,uenc.s 
m^ggenomrcDNA.mRNAorcDKAmaybeiaolated.hr„u^hybridi.«„„;^^. 

oomplem^.a,y,e,uenceofgenomicorsubgenomicnuc.eo«des«,uen«s„oDNAofag™e 
W^norfia^entsthereof Itwinals.beappreci«edbyoneskiUedin.hear.thatdegenL 
DNA ™ can encode a hnnor protein ac«rding to the inven«on and these L also 
n.cndeatoheuK.udedwid^thepresentinv«..ion.sareaUe,icvarian.softhe.^^^^ 

DNA . TT " " ~ '"'^ "-^-^ '^o-bimm. 

DNA t^logy By „ ..^^^^^ 

~ aprotem c^osition is substanti^aiy tiee of other prote^ wM. ^ uo. the 

He present invention also includes therapeutic or phanuaceutical c«„posi.ions 
compHsmg a protein accor^ling to the invention in an efi^ve amomrt fi>r treating parents 
^t^sc^^andamethodc^nprislng^lministeringa^ 
P^^ll^^ecomposTtronaandmedrodaareusefulfortreath^cancers associated 
s«bjec.pro.ems.^^p,osta..eancer.Onesldnedin,he.rtcan,eadily„seavarietyof^^^^ 

^nr.heartt„det«minewhetherthep.otehrwouldbeuaetU,inpr«noth.Lr;: 
funchonmg in a particular cell type. 

Anti-Prostate Antigen AnHhnHiVc 

antn, d^ ^^'^'^ ^ of onagen 

»hb^d.. and ftagments for use as diagnostics and therapeutics, -nrese antibodies may be 

polyclonal ormonoclonal. Polyclonal antrWiescanbepr^ar^d by ^ 
o.herammalsbyinJeotiugantigenfcllowedbysnbse,pentboostsatappr„pri..^^^^ 



wo 2004/113571 



PCT/IB2004/002394 



27 



animals are bled and sera assayed against purified protein usually by ELISA or by bioassay 
based upon the abUity to block the action of the corresponding gene. When using avian 
species, e.g., chicken, turkey and the like, the antibody can be isolated fi-om the yolk of the 
egg. Monoclonal antibodies can be prepared after the method of Milstein and Kohler by 
5 fusing splenocytes firom immunized mice with continuously repUcating tumor cells such as 
myeloma or lymphoma cells. [Milstein and Kohler, Nature 255:495-497 (1 975); Gulfre and 
^^^MethodsinEnzymology:lmnmnochemicalTecMiques73'A-A6,l^ 
eds.. Academic Press, (1981) which are incorporated by reference]. The hybridoma ceUs so 
formed are then cloned by limiting dilution methods and supemates assayed for antibody 

1 0 production by ELISA, RIA or bioassay. 

The unique ability of antibodies to recognize and specifically bind to target protems 
provides an approach for treating an overexpression of the protein. Thus, another aspect of the 
present invention provides for a method for preventing or treating diseases involving 
overexpression of the protein by treatment of a patient with specific antibodies to the protein. 

1 5 Specific antibodies, either polyclonal or monocloiuil, to the protein can be produced by 

any suitable method known in the art as discussed above. For example, by recombinant 
methods, preferably in eukaryotic cells murine or human monoclonal antibodies can be 
produced by hybridoma technology or, alternatively, the protein, or an immunologically active 
firagment thereof or an anti-idiotypic antibody, or firagment thereof can be administered to an 

20 animal to elicit the production of antibodies capable of recognizing and binding to the protein. 
Such antibodies can be firom any class of antibodies including, but not limited to IgG, IgA, 
IgM, IgD, and IgE or in the case of avian species, IgY and firom any subclass of antibodies. 

The availability of isolated protem allows for the identification of small molecules and 
low molecular weight compounds that inhibit the binding of protein to binding partners, 

25 tiirough routine application of high-throughput screening methods (HTS). HTS methods 
generally refer to technologies that permit the rapid assaying of lead compounds for 
therapeutic potential. HTS techniques employ robotic handling of test materials, detection of 
positive signals, and interpretation of data. Lead compounds may be identified via the 
incorporation of radioactivity or through optical assays that rely on absorbance, fluorescence or 

30 luminescence as read-outs. [Gonzalez, J.E. et al. Curr. Opin. Biotech. 9:624-63 1 (1998)]. 
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Model systems are available that can be adapted for use in high throughput screening 
for compounds that inhibit the interaction of protein with its ligand, for example by competing 
withprotein for ligand binding. Sarvbhi et al.. Anal. Biockem. 237:70-75 (1996) describe cell- 
free, non-isotopic assays for discovering molecules that compete with natural Ugands for 
binding to the active site of IL-l receptor. Martens, C. etcd.. Anal Biochem. 275:20-31 (1999) 

describeagenericparticle-basednonradioactivemethodinwhichalabeledligandbindstoits 
receptor immobUized on a particle; label on flie particle decreases in tiie presence of a 
molecule that competes wifli the labeled Ugand for receptor binding. 
Antibody Preparation 

(i) Starting Materials and Methods 

Immunoglobulins (Ig) and certain variants tiiereofaieknown and many havebeenprepared in 
recombinant cell culture. For example, see U.S. Pat. No. 4.745.055; EP 256,654; EP 120,694; 
EP 125,023; EP 255,694; EP 266,663; WO 30 88/03559; Faulkneret al., Natiire, 298: 286 
(1982); Morrison, J. Immun.. 123: 793 (1979); Koehler et al., Proc. Natl. Acad. Sci. USA, 77: 
2197 (1980); Raso et al.. Cancer Res.. 41: 2073 (1981); Morrison et al.. Ann. Rev. Immunol.. 
2: 239 (1984); Morrison, Science, 229: 1202 (1985); and Morrison et al., Proc. Natl. Acad.' 
Sci. USA, 81: 6851 (1984). Reassorted immunoglobulin chains are also known. See, for 
example. U.S. Pat. No. 4,444,878; WO 88/03565; and EP 68,763 andreferences citedth^ein. 
The immunoglobulin moiety in the chimeras of the present invention may be obtained from 
IgG-1, IgG-2. IgG-3, or IgG-4 subtypes, IgA. IgE, IgD, or IgM, but preferably from IgG-1 or 
IgG-3. 

(ii) Polyclonal Antibodies 

Polyclonal antibodies to the subject prostate antigens are generally raised in animals by 
multiple subcutaneous (sc) or intraperitoneal (ip) injections of the antigen and an adjuvant. It 

maybeusefol to conjugatetiie antigen orafragmentcontainingthetarget amino add se^^^ 
to a protein that is immunogenic in the species to be immunized, e.g., kej^iole limpet 
hemocyanin. serum albumin, bovine thyroglobulin, or soybean trypsin inhibitor using a 
bifunctional or derivatizing agent, for example, maleimidobenzoyl sulfosuccinimide ester 
(conjugation through cysteme residues), N-hydroxysuccinimide (through lysine residues). 
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glutaraldehyde or succinic anhydride. 

or derivative by combMng about 1 «g or mg of th. peptide or conjugate (for rabbits or 
mtee, respectively) , volumes of Freuntfs complete adjuvan. and injecting the solution 
mtrademully at multiple sites. One month later the animals are boosted «dth 1/5 to 1/1 0 the 
or,g,nal amount of peptide or conjugate in Freund^s complete adjuvant by subcutaneous 
mjectionatmultiplesites. Sevento Mdayslat^theammalsarebledandtheserumis assayed 
for antibody titer to the «.tigen or a fiagment thereof. Animals are boosted until the titer 
plateaus, fteferably. the animal is boosted with fl,. conjugate of tte same polypeptide or 

ftagm^^thereotbmcoqugatedtoadiffaemproteinand/orthrough adiire^ntcross-H^^ 
reagent Conjugates also can be made in recombinant cell culture as protein fusions. Also 
aggregating agents such as alum are suitably used to enhance the immune response.' 

(iii) Monoclonal Antibodies 

Mon«=l„nal antibodies are obtamed ftom a population of substantially homogeneous 
antibodies, i.e., fte individual antibodies comprising the population are identical excq,t for 
possible naturally occumng mutations tt>at may be present in minor amounts. Tlus the 
modifier "monoclonal" indicates the character of the antibody as not betag a mixture of 

discrete antibodies. 

For example, monoclonal antibodies using for practicingtiusinventionmaybemade 
usmgfl,ehybridomamethodflrstdescribedbyKohlerandMilstein,Natir«, 256: 495(1975) 
or may be made by recombinant DNA methods (Cabilly et al., supra). 

Inthehybridomamethod,amonseorotherappn,priatehostauimal,sudiasahamster 
.s umnunized as hereinabove descdbed u> elicit lymphocytes a«t produce or are capable of 
producing antibodies that wUl specifically bind to tiie antigen or ftagment thereof used for 

.mmunization.Altematively.l^phocytesmaybeimmunizedi„vit,o. Lymphocytes thenare 

fi.sedw..hmyelomacel]susingasuitablefhsingagent.suchaspolyea^^^ 

hybrrdoma ceU (Goding, Monoclonal Antibodies: Principles and Practice, pp.59.103 

[Academic Press, 1986]). 
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Thehybnd<macellstoprep<«dare8«dedandg«>™ma^tableculturemedimn 
that prtfe«bly contains one or more substan«s fl.at inhibit the growfl. „r survival of the 

unftaed.p.r«^m5«lomaeeUs.Porex«nple, if theparental myeloma eeUslackthe enzyme 

hypoxanthineguamnephosphoribosylhansferaseCHGPRTorHPRTithecutoremediumfcr 
the hybndomas typically will include hypoxanthine, aminopterin, and ftymidine (HAT 
med.um). which substances prevent tte growth of HGPRT-deflci^tt cells 
Prefcred myeloma cells are «K,se ftase efSoi«ay. support stable high-level 

production„fantibodybytheseIectedanffl,„dy.p„.hcingceUs,anda«s»siavetoamedium 

such asHATmedium.Amongfl.ese.preferredmyelomacell lines aremurinemyeloma lines 
such as those derived ftom MOPC21 and MPC-U mouse Umxors available torn the Salk 
h^tin-te CeU Distnbudon Center. San Diego, CaUf USA, and SP-2 cells available ftom the 
American Type Culture Collection, Rockville, Md. USA. 

Culture medium in wWA hybddoma cells are growing is assayed for production of 

n.onoclonalannTx>diesdirect«l.gains.d.eprostan,an«geaP,.ferably,t^^ 

ofmonoclonalamibodiesproducedbyhybndoma cells is de.erminedbyimmunop.ecipi,a.io„ 
or by an m vitn> binding assay, such as radioimmunoassay (RIA) or enzym^linked 

mununoabsorbent assay (ELISA). 

Tte binding afSnity of the monoclonal antibody can. for example, be detenmned by 
the Scatchard analysis of Munson and PoUanl, Anal. Biochcm. 107: 220 (1980) 

After hybridoma cells are idenHfled that produce antibodies of tt,e desired specificity 
affinity, and/or activity, fte clones may be subcloned by limiting dilution procedures, ani 
grown by standard methods (Coding, supra). Suitable culture media for this purpose include 

forexample,D.MEMorRPMI.1640m«iium.Inaddidon,fl.ehybridomacellsmaybegmwJ 
m vivo as ascites tumors in an animal. 

The monocl«,al atfibodies secreted by the subdlones ^ suitably separated from die 
cnl«« medimn. ascites fluid, or serum by conventional immunoglobulin purification 
procedures such as, &r example, protem A-Sepharose. hydroxyapatite chr^natography gel 
electrophoresis, dialysis, or afSnity chromatography. 

DNA encoding the monoclonal antibodies of the invention is re«iay isolated and 
sequenced using conventional procedures (e.g.. by using oligonucleotide pmbes fl.at are 
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capable of binding specifically to genes encoding the heavy and light chains of murine 
antibodies). The hybridoma cells of the invention serve as a preferred source of such DNA. 

Once isolated, the DNAmaybeplaced into expression vectors, which are then transfected into 
host cells such as E. coU cells, simian COS cells, Chinese hamster ovary (CHO) cells, or 
myeloma cells that do not otherwise produce immunoglobulin protein, to obtain the synthesis 
of monoclonal antibodies in the recombinant host cells. Review articles on recombinant 
expression in bacteria of DNA encoding the antibody include Sketra et al.. Cuir. Opinion in 
Immunol., 5: 256-262 (1993) and Pluckthun, Immunol. Revs., 130: 151-188 (1992). 

The DNA also may bemodified, for example, by substituting the coding sequence forhuman 
heavy- and Ught-chain constant domains in place of the homologous murine sequences 
(Morrison, et al., Proc. Natl. Acad. Sci. USA, 81 : 6851 [1984]), or by covalentlyjoining to the 

immunoglobulin coding sequence aUorpartofthe coding sequence foranon-immunoglobulin 
polypeptide. In that manner, "chimeric" or "hybrid" antibodies are prepared that have the 
binding specificity of an anti-prostate antigen monoclonal antibody herein. 

Typically such non-immunoglobulin polypeptides are substituted for the constant 
domains of an antibody of the invention, or they are substituted for the variable domains of 
one antigen-combining site of an antibody of the invention to create a chimeric bivalent 
antibody comprising one antigen-combining site having specificity for prostate antigen 

according to the invention and another antigen-comWningsitehavingspecifidlyforadi^^ 
antigen. 

Chimeric or hybrid antibodies also may be prepared in vitro using known methods ui 
synthetic protein chemistry, including those involving crosslinking agents. For example, 
immunotoxins may be constructed using a disulfide-exchange reaction or by forming J 
thioeflier bond. Examples of suitable reagents for this purpose include iminothiolate and 
methyl-4-mercaptobutyrimidate. 

(iv) Humanized Antibodies 

Methods for humanizing non-human antibodies are well known in the art. Generally, a 
humanized antibody has one or more amino acid residues introduced into it fi-om a sourle 
which is non-human. These non-human amino acid residues are often referred to as "import" 
residues, which are typically taken from an "import" variable domain. Humanization can be 
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essentially perfonned following the method of Winter and co-workers (Jones et al.. Nature 
321. 522-525 [1986]; Riechmann et al., Nature 332, 323-327 [1988]; Verhoeyen et al.. Science 
239, 1534-1536 C 1988]), by substituting rodent CDRs or CDR sequences for the corresponding 
sequences of a human antibody Accordingly, such "humanized" antibodies are chimeric 
antibodies (Cabilly et al.. supra), wherein substantially less than an intact human variable 
domain has been substituted by the corresponding sequence from a non-human species In 
practice, humanized antibodies are typically human antibodies in which some CDR residues 
and possibly some FR residues are substituted by residues from analogous sites in rodent 
antibodies. 

The choiceofhumanvariabledomains. both light and heavy.tobeusedinmaking the 
humamzed antibodies is very important to reduce antigenicity. According to the so-called 
"best-fit" method, the sequence of the variable domain of a rodent antibody is screened against 

the entnrelibraryofknown human variable-domain sequences. Thehuman sequence whichis 
closest to that of the rodent is then accepted as the human firework (FR) for the humanized 
antibody (Sims et al.. J. Immunol.. 151: 2296 [1993]; Chothia and Lesk. J. Mol. Biol.. 196: 
901 [1987]). Another method uses a particular framework derived from the consensus 
sequence of all human antibodies of aparticular subgroup of Ught or heavy chains. The same 
framework may be used for several different humanized antibodies (Carter et al.. Proc Natl 
Acad. Sci. USA. 89: 4285 [1992]; Presta et al., J. Immnol., 151: 2623 [1993]). 

It is further important that antibodies be humanized with retention of high affinity for 
the antigen and other favorable biological properties. To achieve this goal, according to a 
preferred method, humanized antibodies are prepared by aprocess of analysis of theparental 
sequences and various conc^tual humanized products using three-dimensional models of the 
parental andhumanizedsequences.TWdimensionalimmunoglobulinmodelsarecomm^ 
available and are famiUar to those skiUed in the art Computer programs are available which 
Illustrate and display probable three-dimensional conformational structures of selected 
.candidate immunoglobulin sequences. Inspection of these displays permits analysis of die 

hkelyrole of theresidues in the functioning ofthe candidate immunoglobulin sequence i e 
the analysis of residues that influence the ability of the candidate immunoglobulin to bind 

antigen. mthisway.FRresiduescanbe selected and combined fromthe consensus andimport 
sequences so that the desired antibody characteristic, such as increased affinity for the target 
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antigen(s), is achieved. In general, the CDR residues are directly and most substantially 
involved in influencing antigen binding. 

(v) Human Antibodies 

Human monoclonal antibodies can be made by the hybridoma method. Human 
myeloma and mouse-human heteromyeloma cell lines for the production of human 
monoclonal antibodies have been described, for example, by Kozbor, J. Immunol. 133, 3001 
(1984); Brodeur, et al., Monoclonal Antibody Production Techniques and Applications, pp.51 - 
63 (Marcel Dekker, Inc., New York, 1987); and Boemer et al., J. Immunol., 147: 86-95 
(1991). 

It is now possible to produce transgenic animals (e.g., mice) that are capable, upon 
immunization, of producing a foil repertoire of human antibodies in the absence of 
endogenous immunoglobulin production. For example, it has been described that the 
homozygous deletion of the antibody heavy-chain joining region (J„) gene in chimeric and 
germ-line mutant mice results in complete inhibition of endogenous antibody production. 
Transfer of the human germ-line immunoglobulin gene array in such germ-line mutant mice 
will result in the production of human antibodies upon antigen challenge. See, e.g., Jakobovits 
et al., Proc. Natl. Acad. Sci. USA, 90: 2551 (1993); Jakobovits et al., Nature, 362: 255-258 
(1993); Bruggennann et al.. Year in Immuno., 7: 33 (1993), 

Altematively, the phage display technology (McCafferly et al.. Nature, 348: 552-553 
[1990]) can be used to produce human antibodies and antibody fragments in vitro, from 
inmiunoglobulin variable (V) domain gene repertokes from non-immunized donors. 

According to this technique. antibodyVdomain genes are cloned in-frame into eitheramajor 
or minor coat protein gene of a fflamentous bacteriophage, such as Ml 3 or fd, and displayed as 
fonctional antibody fragments on the surface of the phage particle. Because the filamentous 
particle contains a single-stranded DNA copy of the phage genome, selections based on tiie 
functional properties of the antibody also result in selection of the gene encoding the antibody 
exhibiting those properties. Thus, the phage mimics some ofthe properties of the B-cell. Phage 
display can be performed in a variety of formats; for their review see, e.g., Johnson and 
Chiswell, Curr. Op. Struct. Biol., 3: 564-571 (1993). Several sources of V-gene segments can 
be used for phage display. Clackson et al., Nature, 352: 624-628 (1991) isolated a diverse aiia,y 



wo 2004/113571 



34 



PCT/IB2004/002394 



Of anti-oxazolone antibodies from a smaU random combinatorial library of V genes derived 
from the spleens of immunized mice. A repertoire of V genes from non-immunized human 
donors can be constructed and antibodies to a diverse aixay of antigens (including self- 
antigens) can be isolated essentially following the techniques described by Marks et al., J. Mol. 
Biol., 222: 581-597 (1991), or Griffith et al., EMBO J., 12: 725-734 (1993). 

In a natural immune response, antibody genes accumulate mutations at a high rate 
(somatic hypermutation). Some of the changes introduced will confer hi^er affinity, and B 
cells displaying higih-afiBnity surface immunoglobulin are preferentially replicated and 
differentiated during subsequent antigen challenge. This natural process can be mimicked by 
employing the technique known as "chain shuffling" (Marks et al., Bio/Technology, 10: 779- 
783 [1992]). In this method, the affinity of "primary" human antibodies obtained by phage 
display can be improved by sequentially replacing the heavy and light chain V region genes 
with repertoires of naturally occurring variants (repertoires) of V domain genes obtained from 
non-immunized donors. This technique allows the production of antibodies and antibody 
fragments with affinities in the nM range. A strategy for making very large phage antibody 
repertoires has been described by Waterhouse et al., Nucl. Acids Res., 21 : 2265-2266 (1993). 

Gene shuffling can also be used to derive human antibodies from rodent antibodies, 
where the human antibody has similar affinities and specificities to the starting rodent 
antibody. According to this method, which is also referred to as "epitope imprinting", the 
heavy or light chain V domain gene of rodent antibodies obtained by phage display technique 
is replaced with a repertoire of human V domain genes, creating rodent-human chimeras. 
Selection on antigen results in isolation of human variable capable of restoring a fimctional 
antigen-binding site, i.e., tiie epitope governs (imprints) the choice of partner. Wh«i the 
process is repeated in order to replace the remaining rodent V domain, a human antibody is 
obtained(seePCTWO93/06213, published Apr. 1. 1993). Unlike traditional humanization of 
rodent antibodies by CDK grafting, this technique provides completely human antibodies, 
which have no fiiamework or CDR residues of rodent origin. 

(vi) Bispecific Antibodies 

Bispecific antibodies are monoclonal, preferably human or humanized, antibodies that 
have binding specificities for at least two different antigens. In the present case, one of the 
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binding specificities wUl be tx> a prostate antigen according to the invention. Methods for 
making bispedfic antftodies are known in the art. 

TraditionaUy, the recombinant production of bispecific antibodies is based on the co- 
expression of two immunoglobulin heavy chain-light chain pairs, where the two heavy chains 
have different specificities (Milstein and Cuello. Nature. 305: 537-539 [1983]). Becauseof the 
random assortment of immunoglobulin heavy and light chains, these hybtidomas (quadiomas) 
produce a potential mixture of 10 different antibody molecules, of which only one has the 

correctbispecificstnicture. Thepuiificationofthe correct molecule, which is usuallydoneby 
afBnity chromatography steps, is rather cumbersome, and the product yields are low. Similar 

procedures are disclosed in WO 93/08829 published May 13, 1993, and in Traunecker et al., 
EMBO J., 10: 3655-3659 (1991). 

According to a different and more preferred approach, antibody-variable domains with 
the desired binding specificities (antibody-antigencombining sites) are fiased to 
immunoglobulin constant-domain sequences. The fusion preferably is with an 

immunoglobulinheavy-chain constant domain, comprising atleastpartofthe hinge, CH2, and 
CH3 regions. It is preferred to have the first heavy-chain constant region (CHI), containing the 
site necessary for light-chain binding, present in at least one of the fiisions. DNAs encoding 
the immunoglobuUn heavy chain fijsions and, if desired, the immunoglobulin light chain, are 
inserted into separate expression vectors, and are co-transfected into a suitable host organism. 

TWs provides for gteatflexibiUty in adjusting themutualpioportionsofthe three polypepti^ 
fiagments in embodiments when unequal ratios of the three polypeptide chains used in the 
construction provide the optimum yields. It is, however, possible to insert the coding 

sequences for two or aUthreepolypeptide chains in one expression vectorwhentheproduction 
of at least two polypeptide chains in equal ratios results in high yields or when the ratios are of 
no particular significance. In a preferred embodiment of this approach, the bispecific 
antibodies are composed of a hybrid immunoglobulin heavy chain with a first binding 

specificity in oneaim,andahybridimmunoglobulinheavychain-li^tchainpair(providinga 
second binding specificity) in the other arm. It was found that this asymmetric structure 
facilitates the separation of the desired bispedfic compound from unwanted immunoglobulin 
chain combinations, as the presence of an immunoglobulin light chain in only one half of the 
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bispedfic molecule provides for a facile way of separation. 

For further details of generating bispecific antibodies, see, for example, Suresh et al.. 
Methods in Enzymology, 121 : 210 (1986). 

(vii) Heteroconjuqate Antibodies 

Heteroconjugate antibodies are also within the scope of the present invention. 
Heteroconjugate antibodies are composed of two covalently joined antibodies. Such antibodies 
have, for example, been proposed to target immune system cells to unwanted cells (U.S. Pat 
No. 4,676,980), and for treatment of HIV infection (WO 91/00360; WO 92/00373; and BP 

03089). Heteroconjugate antibodies may bemadeusing any convenient cross-linldngm^o^^ 
Suitable cross-linking agents are weU known in the art, and are disclosed in U.S. Pat. No. 
4,676,980, along with a number of cross-linking techniques. 

Thepolynucleotides and polypeptidesof the present invention may be utilized in gene 
delivery vehicles. The gene delivery vehicle may be of viral or non-viral ori&nisee generally, 
JoUy. Cancer Gene Therapy 1:51-64 (1994); Kimura, Human Gene Therapy 5:845-852 
(1994); Connelly, Human Gene Therapy 1:185-193 (1995); and Kaplitt, Nature Genetics 
6:148-153 (1994)). Gene therapy vehicles for delivery of constructs including a coding 
sequence of a therapeutic according to the invention can be administered either locaUy or 
systemically. These constructs can utilize viral or non-viral vector approaches. Expressionof 
such coding sequences can be induced using endogenous mammaUan or heterologous 
promoters. Expression of the coding sequence can be either constitutive or regulated. 
Preferred vehicles for gene therapy include retroviral and adeno-viral vectors. 

Representative examples of adenoviral vectors include those described by Berkner, 
Biotechniques 5:616-627 (Biotechniques); Rosenfeld etal, Science 252:431^34 (1991); WO 
93/19191;Kons etal.,RN.A.S. 215-219 (1994); Kass-BisIeretal.,P.N.A.S. 90: 11498-n502 
(1993); Guzman etal.,CWfl«o« 88: 2838-2848 (1993); Guzmane/ a/., Cir.Res. 73: 1202- 
1207 (1993); Zabner et al.. Cell 75: 207-216 (1993); Li et al., Hum. Gene Ther. 4: 403-409 
(1993); Cailaud et al., Eur. J. Neurosci. 5: 1287-1291 (1993); Vincent et al., Nat. Genet. 5: 

130-134(1993); Jafifeetal.,iVa?.Ge«er. 1:372-378 (1992); and Levrero etal.. Gene 101: 195- 
202 (1992). Exemplary adenoviral gene therapy vectors employable in this invention also 
include those described in WO 94/12649, WO 93/03769; WO 93/19191 ; WO 94/28938; WO 
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95/1 1984 and WO 95/00655. Administration ofDNA linked to m adenovirus as described in 
Curiel, Hum. Gene Ther. 3: 147-154 (1992) maybe employed. 

Other gene deUvery vehicles and methods may be employed; including polycationic 
condensed DNA linked or unlinked to kill adenovirus alone, for example Curiel, Hum. Gene 
Titer. 3: 147-154(1992); ligand-linkedDNA, for example see Wu,J:Ao/. Chem. 264: 16985- 
16987 (1989); eukaiyotic ceU deUvery vehicles cells, for example see U.S. Serial No. 
08/240,030, filed May 9, 1994, and U.S. Serial No. 08/404,796; deposition of 
photopolymerized hydrogel materials; hand-held gene transfer particle gun. as described in 
U.S. Patent No. 5,149,655; ionizing radiation as described in U.S. PatentNo. 5,206,152 and in 
WO 92/11033; nucleic charge neutralization or fusion with cell membranes. Additional 
approaches are described in PhiUp, Mol. CeU Biol. 14:241 1 -241 8 (1 994), and in Woffendin, 
Proc. Natl. Acad. Sci. Pi:1581-1585 (1994). 

Naked DNA may also be employed. Exemplary naked DNA introduction methods are 
described in WO 90/11092 and U.S. Patent No. 5,580,859. Uptake efficiency may be 
improved using biodegradable latex beads. DNA coated latex beads are efficiently transported 
into cells after endocytosis initiation by the beads. The method may be improved fijrther by 
treatment of the beads to increase hydrophobidty and thereby feciUtate disruption of the 
endosome and release of the DNA into the cytoplasm. Liposomes that can act as gene deUvery 
vehicles are described in U.S. Patent No. 5,422,120, PCX Patent PubUcation Nos. WO 95/13 
796, WO 94/23697, and WO 91/14445, and EP No. 0 524 968. 

Further non-viral delivery suitable for use includes mechanical delivery systems such 
as the approach described in Woffendin et al., Proc. Natl. Acad. Sci. USA 91(24): 11581- 
11585 (1994). Moreover, the coding sequence and the product of expression of such can be 
delivered through deposition of photopolymerized hydrogel materials. Other conventional 
methods for gene deUvery that can be used for deUvery of the coding sequence include, for 
example, use of hand-held gene transfer particle gun, as described in U.S. Patent No. 
5.149,655; use of ionizing radiation for activating transferred gene, as described in U.S. Patent 
No. 5,206,152 and PCT Patent PubUcation No. WO 92/1 1033. 

The subject antibodies or antibody fi-agments may be conjugated directly or indirectly 
to effective moieties, e.g., radionuclides, toxins, chemotherapeutic agents, prodrugs, cytoslatic 
agents, enzymes and the Uke. In a preferred embodiment the antibody or fragment will be 
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attached to a therapeutic or diagnostic radiolabel direcfly or by use of a chelating agent. 
Examples of suitable radiolabels are well known and include '^I, " lo^jy^ ^^^Sm 
'''Cu,^'Ga,^^o,'^V^eand'««Re. ' ' 

Examples of suitable drugs that my be coupled to antibodies include methotrexate, 
5 adriamycine and lymphokines such as interferons, interleukins and the like. Suitable toxins 
which may be coupled include ricin, cholera and diptheria toxin. 

In a preferred embodiment, the subject antibodies will be attached to a therapeutic 
radiolabel and used for radioimmunotherapy. 

Anti-sense Oligonucleotides 
1 0 In certaui circumstances, it may be desirable to modulate or decrease the amount of the 

protein expressed by a prostate cell. Thus, in another aspect of the present invention, anti- 
sense oligonucleotides can be made and a method utilized for diminishing the level of 
expression a prostate antigen according to the invention by a cell comprising administering 
one or more anti-sense oligonucleotides. By anti-soise oligonucleotides reference is made to 
15 oligonucleotides that have a nucleotide sequraice that interacts through base pairing with a 
specific complementary nucleic add sequence involved in the expression of the target such 
that the expression of the gene is reduced. Preferably, the specific nucleic acid sequence 
involved in the expression of the gene is a genomic DNA molecule or mRNA molecule that 
encodes the gene. This genomic DNA molecule can comprise regulatory regions of the gene, 
20 or the coding sequence for fihe mature gene. 

The term complementary to a nucleotide sequence in the context of antisense 
oligonucleotides and methods therefor means sufficiently complementary to such a sequence 
as to allow hybridization to that sequence in a cell. i.e.. under physiological conditions. 
Antisense oligonucleotides preferably comprise a sequence containing from about 8 to about 
25 100 nucleotides and more preferably the antisense oUgonucleotides comprisefiom about 15 to 
about 30 nucleotides. Antisense oligonucleotides can also contain a variety of modifications 
that confer resistance to nucleolytic degradation such as, for example, modified 
intemucleoside linages [Uhlmann and Peyman, Chemical Reviews 90:543-548 (1990); 

Schneider andBanner,re/raAerfron£eff.31:335, (1990) which are incorporated byreferen^^^ 
30 modified nucleic acid bases as disclosed in 5,958,773 and patents disclosed therein, and/or 
sugars and the like. 
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Any modifications or variations of the antisense molecule which are known in the art 
to be broadly applicable to antisense technology are included within the scope of the invention. 
Such modifications include preparation of phosphorus-containing linkages as disclosed in 
U.S. Patents 5.536,821; 5.541,306; 5,550,1 11; 5,563,253; 5.571,799; 5,587.361, 5,625,050 and 
5 5,958,773. 

The antisense compounds of the invention can include modified bases. The antisense 
oligonucleotides of the invention can also be modified by chemically linking flie 
oligonucleotide to one or more moieties or conjugates to enhance the activity, cellular 
distribution, or cellular uptake of the antisense oUgonucleotide. Such moieties or conjugates 
10 include Upids such as cholesterol, choUc acid, thioether, aHphatic chains, phospholipids, 
polyamines. polyetiiylene glycol (PEG), pahnityl moieties, and others as disclosed in, for 
example, U.S. Patents 5,514,758, 5,565,552, 5,567,810, 5,574,142, 5,585,481. 5,587,371, 
5,597,696 and 5,958,773. 

Chimeric antisense oligonucleotides are also within the scope of the invention, and can 
be prepared fi-om tiie present inventive oUgonucleotides using flie methods described in, for 
example. U.S. Patents 5,013,830, 5,149,797. 5,403.711, 5,491,133, 5,565,350, 5,652,355, 
5,700,922 and 5.958,773. 

In the antisense art a certain degree of routine experimentation is required to select 
optimal antisense molecules for particular targets. To be effective, tiie antisense molecule 

20 preferably is targeted to an accessible, or exposed, portion of the target RNA molecule. 
Altiiough in some cases information is available about the structure of target mRNA 
molecules, tiie current approach to inhibition using antisense is via experimentation. mRNA 
levels in tiie cell can be measured routinely in tireated and control cells by reverse transcription 
of tiie mRNA and assaying tiie cDNA levels. The biological effect can be .detennined 

25 routinely by measuring cell growtii or viabiUty as is known in tiie art. 

Measuring flie specificity of antisense activity by assaying and analyzing cDNA levels 
is an art-recognized metiiod of vaUdating antisense results. It has been suggested tirnt RNA 
firom treated and contix)! cells should be reverse-tiranscribed and ttie resulting cDNA 
populations analyzed. [Branch, A. D., T.LB.S. 23:45-50 (1998)]. 

30 The flierapeutic or pharmaceutical compositions of tiie present invention can be 

administered by any suitable route known in tiie art including for example intravenous. 
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subcutaneous. mtran™scul«,tarfennal.mtafl.«»lormtrac^eb^^ A<toinistotio„oanbe 
other rapid as by injectio., or over a period of tune as by slow infusion or adminisfation of 

slow release formulation. 

Additionally, the subject prostate tumor proteim can also be linkedorc^^^^ 

agents that provide desirablephannaceutical or phannacodynamic properties. For e^^^^ 
the protem can be coupled to any substance known in the art to promote penetration or 
transport across the blood-brain bamer such as an antibody to the transferrin receptor and 
admmistered by intravenous injection (see, for example. Friden et al.. Science 259:373-377 
(1993) which is incorporated by reference). Furthermore, the subject protein A or protein B 

canbestably linked toapolymer such as polyethyleneglycol to obtain desirableproperties^ 
solubiKty, stability. half-Hfe and other pharmaceutically advantageous properties. [See for 

example, Davis etal.,5„z,;ne5«^. 4:169-73(1978); Buruham.^m.y.^a^p.PW^ 
218 (1994) which are incorporated by reference]. 

The compositions are usuaUy employed in the form of pharmaceutical preparations 
Such preparations are made in a mamier well known in the pharmaceutical art See. e g 
RemingtonPhaimaceutical Science, ISthEd., Merck Publishing Co. EastemPA, (1990) 'c^e 

prefen:edpreparationutilizesavehicleofphysiologicalsalinesolution.butitisconte^^^^^ 
that other pharmaceutically accq,table carriers such as physiological concentrations of other 
non-toxic salts, five percent aqueous glucose solution, sterile water or the like may also be 
used. It may also be desirable lhat a suitable buffer be present in the composition. Such 
solutions can. if desired, be lyophiUzed and stored in a sterile ampoule ready for reconstitution 
by the addition of sterile water for ready injection. The primary solvent can be aqueous or 
altemativelynon-aqueous. The subjectprostate tumor antigens, fragments orvariantsther^^ 

canalsobeincorporatedintoasoHdorsemi-soUdbiologicaUycompatiblematrixwhichcanbe 
implanted into tissues requiring treatment 

The carrier can also contain other phaimaceutically-acceptable excipients for 
modifyingormaintainingthepH, osmolarity, viscosity, clarity, color, steriKty. stability, rateof 
dissolution, or odor of the formulation. Similarly, the carrier may contain still other 

pharmaceuticaUy-acceptableexdpientsformodifyingormaintainingreleaseorabso^^^^ 
penetration across the blood-brain barrier. Such excipients are those substances usually and 

customarily employed to for^ulatedosagesforparentaladministrationineitherunitdosag^ 
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multi-dose fonn or for direct infosion into the cerebrospinal fluid by continuous or periodic 



infusion 



Dose administration can be repeated depending upon the pharmacokinetic parameters 
of the dosage formulation and the route of administration used. 

It is also contemplated that certain formulations containing the subject antibody or 
nucleic add antagonists are to be administered oraUy. Such formulations are preferably 
encapsulated and formulated vdth suitable carriers in soUd dosage forms. Some examples of 
suitable carriers, excipients, and diluents include lactose, dextrose, sucrose, sorbitol, mannitol, 
starches, gum acada, caldum phosphate, alginates, caldum silicate, microcrystalline cellulose] 
polyvinylpyitoUdone, cellulose, gelatin, syrup, methyl cellulose, methyl- and 
piopylhydroxybenzoates, talc, magnesium, stearate, water, mineral oil, and the like. The 
formulations can additionally include lubricating agents, wetting agents, emulsifying and 
suspending agents, preserving agents, sweetening agents or flavoring agents. The 
compositions may be formulated so as to provide rapid, sustained, or delayed release of the 
active ingredients after administration to the patient by employing procedures weU known in 
the art. The formulations can also contain substances that diminish proteolytic degradation 
and promote absorption such as, for example, surface active agents. 

The specific dose is calculated according to the approximate body wdght or body 
surface area of the patient or the volume of body space to be occupied. Hie dose will also be 

calculated dependent upon theparticularrouteofadministrationselected. Further refinement 
of the calculations necessary to determine the appropriate dosage for treatment is routinely 
made by those of ordinary skill in the art Such calculations can be made without undue 
experimentation by one skilled in the art in light of the activity disclosed herdn in assay 
preparations of target cells. Exact dosages are determined in conjunction with standard dose- 
response studies. It will be understood that flie amount of the composition actually 
administered will be determined by a practitioner, in the Ught of the relevant circumstances 
including the condition or conditions to be treated, flie choice of composition to be 
administered, tiie age, wdght, and response of the individual patient, tiie severity of tiie 
patient's symptoms, and tiie chosen route of administration. 

m one embodiment of tiiis invention, tiie protein may be tiierapeuticaUy administered 
by implanting into patients vectors or cells capable of producing a biologicaUy-active form of 
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the protein or a precuraor of protein, /.e., a molecule that can be readily converted to a 
biological-active form of the protein by the body. In one approach, cells that secrete the 
protein may be encapsulated into semipermeable membranes for implantation into a patient. 
The cells can be cells that normally express the protein or a precursor thereof or liie ceUs can 
be transformed to express the protein or a precursor thereof It is preferred that the cell be of 

human origin and that theproteinbeahuman protein when thepatient is human. However,it 
is anticipated that non-human primate homologues of the protein discussed infia may also be 
^ective. 

Detection of Snbie ct Prostate Pm teins or Nucleic Ap.iH^ 

Inanumberof circumstances itwouldbedesirabletodetemiinethelevelsofprotein or 
corresponding mRNA in a patient. Evidence disclosed infra suggests the subject prostate 

proteins may be expressed at differentlevels during somediseases,e.g., cancers, provides the 
basis for the conclusion that the presence of these proteins serves a normal physiological 
fonctionrelated to ceU growth and survival. Endogenously produced protein according to the 
invention may also play a role in certain disease conditions. 

The tem» "detection" as medherein in thecontext of detecting thepi^enceofpiotem 
in a patient is intended to include the determining of the amount of protem or the abiUty to 
express an amount of protein in a patient, the estimation of prognosis in terms of probable 
outcome of a disease and prospect for recovery, the monitoring of the protein levels over a 
period of time as a measure of status of the condition, and the monitoring of protein levels for 
determining a preferred therapeutic regimen for the patient. e.g. one with prostate cancer. 

To detect the presence of a prostate protein according to the invention in a patient, a 

sampleis obtained from the patient. Thesamplecanbeatissuebiopsy sample orasampleof 
blood, plasma, serum. CSF. urine or the like. It has been found that the subject proteins are 
expressed at high levels in some cancers. Samples for detecting protein can be taken from 
prostate tissues. When assessing peripheral levels of protein, it is preferred that the sample be 
a sample of blood, plasma or serum. When assessing the levels of protein in the central 
nervous system a preferred sample is a sample obtained from cerebrospinal fluid or neural 
tissue. The sample may be obtained by non-invasive methods, such as from tissue collection(s) 
or cultute(s), or using directly available tissue material (urine, saliva, stools, hair, etc.). 
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In some instances, it is desirable to detennine whether the gene is intact in the patient 
or in a tissue or cell line within the patient By an intact gene, it is meant that there are no 
alterations in the gene such as point mutations, deletions, insertions, chromosomal breakage, 
chromosomal rearrangements and the like wherein such alteration might alter production of 
the corresponding protein or alter its biological activity, stabiUty or the like to lead to disease 
processes. Thus, in one embodiment of the present invention a method is provided for 
detecting and characterizing any alterations in the gene. The method comprises providing an 
oUgonucleotide that contains the gene, genomic DNA or a fiagment thereof or a derivative 
thereof. By a derivative of an oUgonucleotide, it is meant that the derived oligonucleotide is 
substantially the same as the sequence from which it is derived in tiiat the derived sequence 
has sufficient sequence complementarily to the sequence from which it is derived to hybridize 

specificallyto the gene. The derivednucleotide sequence is notnecessarilyphysically derived 
from tiie nucleotide sequence, but may be generated in any manner including for example, 
chemical synthesis or DNA repUcation or reverse tianscription or transcription. 

Typically, patient genomic DNA is isolated from a cell sample fi«m the patient and 
digested wilh one or more restiiction endonucleases such as, for example, TaqI and Alul. 
Using the Southern blot protocol, which is weU known in the art, this assay determines 

wheflierapatientoraparticular tissue inapatient has an intact prostate gene according to tixe 
invention or a gene abnormality. 

Hybridization to a gene would involve denaturing the chromosomal DNA to obtain a 
single-stiranded DNA; contacting the single-stranded DNA witii a gene probe associated with 
the gene sequence; and identifying the hybridized DNA-probe to detect chromosomal DNA 
containing at least a portion of a gene. 

The term '"probe" as used hereinrefastoastructure comprised ofapolynucleotide that 
forms a hybrid structure with a target sequence, due to complementarily of probe sequence 
with a sequence in flie target region. Oligomers suitable for use as probes may contain a 
minimum of about 8-12 contiguous nucleotides which are complementary to the targeted 
sequence and preferably a minimum of about 20. 

A gene according to the present invention can be DNA or RNA oligonucleotides and 
can be made by any method known in the art such as, for example, excision, transcription or 
chemical syntiiesis. Probes may be labeled with any detectable label known in the art such as. 
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for example, radioactive or fluorescent labels or enzymaticmarker^ 

be accomplished by any method lcno>vn in the art such as by PGR, random priming, end 
labeling, nick translation or the like. One skilled in the art will also recognize that other 
methods not employing a labeled probe can be used to determine the hybridization. Examples 
of methods that can be used for detecting hybridization include Southern blotting, fluorescence 

in situhybridization, and single-strand confonnationpolymoipWsmwithPCRamp^ 

Hybridization is typically carried out at 25° - 45» C, more preferably at 32» -40° C and 

moreprefeiablyat37°.38-C.Thetimerequiredforhybridizationisfiomabout0.25toabo^^ 

96hours,morepreferablyfiom about one to about 72 houis,andmostpreferablyfromabout4 
to about 24 hours. 

Gene abnormalities can also be detected by using the PGR method and primers that 
flankorUewithinthegene. The PGR method is well known in the art. Briefly, this method is 
performed using two oligonucleotide primers which are capable of hybridizing to the nucleic 
acid sequences flanking a target sequence that lies within a gene and ampUfying the target 
sequence. The terms "oligonucleotide primer" as used herein refers to a short strand of DNA 
or RNA ranging in length ftom about 8 to about 30 bases. The upstream and downstream 
primers are typically fiom about 20 to about 30 base pairs in length and hybridize to the 

flankingregions for repUcationof&enucleotidesequence.Thepolymerizationis catalyzed by 

aDNA-polymeraseinthepresenceofdeoxynucleotidetriphosphatesornucleotideanalogsto 
produce double-stranded DNA molecules. The double strands are then separated by any 
denaturing method including physical, chemical or enzymatic. Gommonly. a method of 
physical denaturation is used involving heating the nucleic acid, typically to temperatures fiom 
about 80°G to 105OG for times ranging from about 1 to about 10 minutes. The process is 
repeated for the desired number of cycles. 

Theprimers are selected tobe substantially complementary to the sttandofDNAbeing 

amplified. Therefore, theprimersneednotreflect the exact sequenceofthetemplate,butmust 
be suffidentiy complementary to selectively hybridize with flie strand being ampUfied. 

After PGR amplification, flieDNA sequence comprismg the gene orafragment thereof 
is then directly sequenced and analyzed by comparison of the sequence with the sequences 

disclosedherem to identifyalteratiomwhichmight change activity or expressionlevelsorthe 
like. 
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In another embodiment, a method for detecting a tumor protein according to the 
mvention is provided based upon an analysis of tissue expressing the gene. Certain tissues 
such as prostate tissues have been found to overexpress the subject gene. The method 
comprises hybridizing a polynucleotide to mRNA from a sample of tissue that normally 
expresses the gene. The sample is obtained from apatient suspected of having an abnormality 
in the gene. 

To detect the presence of mRNA encoding the protein, a sample is obtained fix>m a 
patient. The sample can be from blood or from a tissue biopsy sample. The sample may be 
treated to extract fhe nucleic acids contained therem. The resulting nucleic acid from the 
sample is subjected to gel electrophoresis or other size separation techniques. 

The mRNA of the sample is contacted with a DNA sequence serving as a probe to 
form hybrid duplexes. The use of a labeled probes as discussed above allows detection of the 
resulting duplex. 

When using the cDNA encoding the protem or a derivative of the cDNA as a probe, 
high stringency conditions can be used in order to prevent false positives, that is thi 
hybridization and apparent detection of the gene nucleotide sequence when in feet an intact 

and functioning gaie is not present When using sequences derived from the gene cDNA,less 
stringent conditions could be used, however, this would be a less preferred approach because 

ofthelikelihoodoffalsepositives. The stringency ofhybridization is determinedbyanumber 
effectors duringhybridization and during thewashingprocedure, including temperature, iom^ 
strength, length of time and concentration of formamide. These factors are outlined in, for 
example, Sambrook et al. [Sambrook et al. (1989), supra]. 

In order to increase the sensitivity of the detection in a sample of mRNA encoding the 
protein A or protein B, the technique of reverse transcription/ polymerization chain reaction 

(RT/PCR)can be used to amplify cDNA transcribed from mRNA encodingtfaeprostate tumor 
antigen. The method of RT/PCR is well known in the art. and can be performed as foUows. 
Total cellular RNAis isolatedby, for example, the standard guanidiumisothiocyanate method 
and the total RNA is reverse transcribed. The reverse transcription method involves synthesis 
of DNA on a template of RNA using a reverse transcriptase enzyme and a 3' end primer. 
Typically, the primer contains an oligo(dT) sequence. The cDNA thus produced is then 
amplified using the PGR method and gene A or gene B specific primers. [Belyavsky et al.. 
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NucLAcidRes. 17:2919-2932 (1989); Kmg andBerger, il/efAo* m£nzymo/ogv, 152:316- 
325, Academic Press, NY (1987) which are incorporated by reference]. 

The polymerase chain reaction method is performed as described above using two 
oligonucleotide primers that are substantially complementary to the two flanking regions of the 
5 DNA segment to be amplified. Following amplification, the PGR product is then 
electirophoresed and detected by ethidium bromide staining or by phosphoimaging. 

The present invention further provides for methods to detect the presence of the protan 
in a sample obtained fix)m apatient. Any method known in the art for detecting protems can 
be used. Such methods include, but are not limited to immunodiffusion, 

10 immunoelectrophoresis, immunochemical methods, binder-ligand assays, 
immunohistochemical techniques, agglutination and complement assays. [Basic and Clinical 
Immunology, 217-262, Sites and Terr, eds., Appleton & Lange,Norwalk, CT. (1991), which is 
incorporated by reference]. Preferred are binder-ligand immunoassay methods including 
reacting antibodies with an epitope or epitopes of the prostate tiamor antigen protein and 

15 competitively displacing a labeled prostate antigen according to the invention or derivative 
thereof. 

As used herein, a derivative of the subject prostate tirnior antigen is intended to include 
a polypq)tide in which certain amino adds have been deleted or replaced or changed to 

modified or unusual amino adds wherein the derivative is biologically equivalent to gene and 
20 wherdntiie polypeptide derivative cross-reacts witii antibodies raised against the protein. By 
cross-reaction it is meant tiiat an antibody reacts with an antigen otiier than the one tiiat 
induced its formation. 

Numerous competitive and non-competitive protdn binding immunoassays are weU 
known in the art. Antibodies employed in such assays maybe unlabeled, for example as used 
25 inagglutinationtests,orlabeledforuseinawidevarietyofassaymefliods. Labels that can be 
used include radionucUdes, enzymes, fluorescers, diemiluminescers, aizyme substiates or co- 
factors, enzyme inhibitors, particles, dyes and the like for use in radioinununoassay (RIA), 
enzyme immunoassays, e.g.. enzyme-linked immunosorbent assay (ELISA), fluorescent 
immunoassays and the like. 

30 
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A further aspect of this invention relates to a method for selecting, identifying, 
screening, characterizing or optimizing biologicaUy active compounds, comprising I 
determination of whether a candidate compound binds, preferably selectively, a target 
molecule as disclosed above. Such target molecules include nucleic acid sequences, 
polypeptides and fragments thereof, typically prostate-specific antigens, even more preferably 
extracellularportions thereof. Bindingmay be assessed in vitro or in vivo, typically in vitro, in 
cell based or acceUular systems. TypicaUy, the target molecule is contacted with the 

compound in any appropriate device, and the formation ofacomplex is deterniined.Tlietarget 
molecule and/or the candidate compound maybe immobilized on a support. The compounds 
identified or selected represent drag candidates or leads for treating cancer diseases, 
particularly prostate cancor. 



15 



20 



25 



While the invention has been described supra, including preferred embodiments, the 
following examples are provided to fijrther illustrate the mvention. 

EXAMPLE 

Tissue Sources: 

Appropriate patient samples were procured for evaluation of research protocol. 
Samples were provided with relevant clinical parameters, and patient consent. Histological 

assessment was performed on all samples and diagnosis by pathology confirmed the presence 
and/or absence of mahgnancy within each sample. Clinical data generally included patent 
history, physiopatholpgy, and parameters relating to prostate cancer physiology. Ten normal 
and ten malignant samples were procured along with available cUnical information. In 
addition, ten samples from organs other than normal prostate and prostate cancer were 
procured to determine the tissue specific expression profile of epitopes. RNA derived from 
normal tissue samples was obtained from known commercial sources. 



Generation of the DATAS Library 

Samples were pooled based on their pathological diagnosis (normal vs. tumor). 
30 Samples were pooled based on equivalent amounts of total RNA to produce total pooled RNA 
samples of lOOug. DATAS libraries were constracted as previously disclosed in U.S. Patent 
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No. 6^51,590, the disclosure of which is incorporated by reference in its entirety. Briefly, 
total RNA was isolated ftom the nonnal and tumor pooled samples and mRNA wal 
subsequently purified from the total RNA for each pooled sample. Synthesis of cDNA was 
performed using abiotinylated oligo (dT) primer. Thebiotinylated cDNA was hybridized with 

5 themRNAoftheoppositesampletoformheteroduplexesbetweenthecDNAandthemRNA. 
For example, the biotinylated cDNA of the pooled normal prostate sample was hybridized 
withprostatetumormRNA. Similarly, prostate tumor biotinylated cDNA was hybridized with 
prostate nonnal RNA to generate the secondDATAS Ubrary. Streptavidin coated beads were 
used to purify the complexes by binding the biotin present on the cDNA. The heteroduplexes 
10 weredigestedwithRNAseHtodegradetheRNAthatwascomplementarytothecDNA. All 
mRNA sequences that were different from the cDNA remained intact. These single stranded 
RNA fragments or "loops" were subsequently amplified with degenerate primers and cloned 
into either pGEM-TorpCRnTOPOvector(Companysource)to produce theDATASUbraiy. 

1 5 Clone sequencing and Bioinfonnatics Analysis: 

The DATAS library was used to transfonn E. coli so that individual clones could be 
isolated using standardmolecularbiologytechniques. From these Ubiaries, 10,665 individual 
clones were isolated and sequenced using an automated AppHed Biosystems 3 1 00 sequencer. 

The nucleotide sequences thatwere obtained were submitted to Ihebioinfoimaticspipeline for 
20 analysis. As the DATAS Ubrary is prepared with PGR amplified DNA, many copies of the 

same sequence are present in theclones isolated from the libraries. Thereforeit is important to 
reduce the redundancy of the clones to identify the number of unique, nonr^eating sequences 
that are isolated. From this large set of DATAS fragments. 1699 unique, nonredundant 
sequences were identified and each DATAS fragment was annotated with a candidate gene. 

25 The annotation was perfonned by aUgning the DATAS fragment to the human genome 
sequence by two methods; 1) a publicly available aligmnent and genome viewer tool, Blat 
(Kent et al.. 2002); and 2) a commercially available genomic alignment andviewer tool. 
Prophecy (Doubletwist). Each DATAS fragment sequence was annotated with I 
corresponding gene that overlapped the genomic sequence containing the DATAS fragment. 

JO Genes were annotated with either the RefSeq accession number, or a hypothetical gene 
prediction from different algorithms, for example, Genscan, Twinscan, or Fgenesh++. 
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Identified genes were either matched to the sequence of the DATAS fragment (in case of exon 
to fragment match), or overlapped with the DATAS fragment (in case of intron to fragment 
match), and the fidl length sequence of the gene was identified. These sequences wei« further 
analyzed to detect all potential membrane spanning proteins. Membrane proteins were 
predicted through the use of different algorithms pubUcly available. For example. TMHMM 
(CBS) was used to identify membrane-spanning domains present within the amino acid 
sequence of the candidate gene. DATAS fragments were located within liie sequence in an 
attemptto detennine whether the spliced event affected infracellular or extraceullar domains. 
Genes associated with the sequence were ranked in order to maximize the identification of 
successful tiierapeutic targets. The highest priority genes had characteristics where the gene 
was a known membrane protein, the fimction of tiie gene was known, and the DATAS 
fragment mapped to an infron on the exti^ceUular domain of tiie protein, indicating fliat the 
DATAS fragment would be presented outside the cell, and available for therapeutic 
intervention by monoclonal antibodies. 

Based on the bioinfoimatic analysis, clones were prioritized in three groups: 

A) Known transmembrane genes with DATAS fi:agments located in introns on the 
extracellular domain. 

B) Known and predicted transmembrane genes with DATAS Augments located in 
exons in either the extracellular or mti:acellular domain. 

C) DATAS firagments that did not match tiie genome 

Expression Monitoring: 

A valid epitope target for prostate cancer requires tiiat tiie expression of tiie epitope be 
limited to prostate tissue, or preferably to prostate tumors. Assessment of tiie expression 
profile for each prioritized sequence was performed by RT-PCR, a procedure well known in 
tiieart. A protocol known as touchdown PGR was used, described in flie user's manual for the 
GeneAmp PGR system 9700. AppUedBiosystems.Briefly,PCRprimersweredesigned to tiie 
DATAS firagmentandusedforendpointRT-PGRanalysis. Each RT reaction contained 5 ng 
of total RNA and was performed in a 100 ^il volume using Archive RT Kit (AppHed 
Biosystems). The RT reactions were diluted 1:50 witii water and 4 ^ti of tiie dUuted stock was 
used in a 50 Hi PGR reaction consisting of one cycle at 94°G for 3 min, 5 cycles at 94"G for 30 
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seconds, 60"C for 30 seconds and 72»C for 45 seconds, with each cycle reducing the annealing 
temperature by 0.5 degree. This was followed by 30 cycles at 94'>C for 30 seconds, 55*^0 for 
30 seconds, and 72 °C for 45 seconds. 1 5 jil was removed from each reaction for analysis and 
the reactions were allowed to proceed for an additional lOcycles. This produced reactions for 
analysis at 30 and 40 cycles, and aUowed the detection of differences in expression where the 
40 cycle reactions had saturated. The level of expression proffle of the DATAS fragment was 
determined in normal and tumor prostate total RNA, as well as total RNA from normal 
samples of brain, heart, liver, lung, kidney, colon, bone marrow, muscle, spleen, and testis. 

Expressionprofiles were prioritized accordinglyforspecificexpressioninprostate tumor and 
low expression found in normal tissues, including normal prostate. 

Verification of RNA Structure: 

DATAS identifies sequences that are altered between the experimental samples. 

However, tiie exact sequenceofflie junctions or borders that theDATAS fragment represents 
can not be determined direcfly from the isolated DATAS fi^gment sequence. The DATAS 
fragment was used, however, to design experiments that elucidate the sequence of each 
transcript present in each sample. Primers were designed to ampUfy a region of the gene larger 
than tiie proposed DATAS fragment sequence. These amplicons were subsequently cloned 
and sequenced for the identification of the exact junctions of all exons and introns. This 
required partial cloning of the isofoims from an identified sample to verify the primary 
stinichare (sequence) of tiie isoforms. All twenty samples (10 normal and 10 tumor samples) 
initially used to generate tiie DATAS libraries were used for tiie verification of tiie mRNA 
structure of the prioritized genes. 

Isolation of full-lengtb clones of isoforms: 

Isolation of tiie fiilHengtii clones containing botii isoforms was accompUshedutiUzmg 
tiie information and DNA fragments generated during flie siructure vaUdation process. Several 
metiiods are appUcable to isolation of flie Ml lengtti clone. Where Ml sequence information 
regarding tiie coding sequence is available, gene specific primers were designed from flie 
sequence and used to amplify flie coding sequence directfy from flie total RNA of tiie tissue 
samples. An RT-PCR reaction was set up using tiiese gene specific primers. The RT reaction 
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was performed as described infra, using oligo dT to prime for cDNA. Second strand was 

produced by standard methods to produce double stranded cDNA. PGR amplification of the 

gene was accompUshed using gene specific primers. PGR consisted of 30 cycles at 94*'C for 

30 seconds, 55°C for 30 seconds, and 72 °C for 45 seconds. The reaction products were 

analyzed on 1% agarose gels and the ampUcons were ligated into prepared vectors with A 

overhangs for ampUcon cloning. 1 jd oftheUgation mixture was used to transform E. coU for 

cloning and isolationoftheampUcon.Ctacepurified,theplasmid containing the ampUcon was 
sequenced on an ABI 3 100 automated sequencer. 

Where limited sequence information was available, the oligo pulling method was utiUzed. 
Briefly, a gene-specific oligonucleotide was designed based on the DATAS fragment. The 
oligonucleotide was labeled with biotin and used to hybridize with a single stranded plasmid 
DNA library prepared from either normal prostate tissue or prostate tumortissue following the 
procedures of Sambrook et al (1989). The hybridized cDNA was separated by streptavidin 
conjugated beads and eluted by heating. The eluted cDNA was converted to double strand 

plasmidDNA and used to transform E.coU cells and the longest cDNA clone was subjected to 
DNA sequencing. 

RESULTS 

Using methods described above, 1699 DNA fi:agments have been identified tiiat 

putatively correspond to exons(novelsphcevariants)expressed exclusively or at an increased 
level in prostate tumor tissue when compared to matched normal prostate tissue. 

These sequences were used to search pubUc databases containing human genomic 
sequences to identify related genes. This search identified 122 Augments tiiat correspond to 
exons of either known or potaitial cell surface proteins. 

Additionally, thirty seven distinct alternatively spliced isoforms were identified from 
die initial sequence tags that appear to contain novel sequence information of cell surfece 
proteins. 

These DNA sequences are disclosed in the Sequence Listing as well as in Table 1, and 
correspond to the nucleic acid sequences having SEQ ID NOS: 1-173, 175, 177, 179,andl81. 
OUgonucleotide primers were designed to each DATAS fragment to determine the specific 
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expression ofthemRNA in apanelofnonnal human tissue. An example is shown in figure 1, 
where the clone corresponding to Sequence ID: No. 92 displays specific expression in prostate 
with very low levels detected in kidney (lane 4) and pancreas (lane 9). All clones that were 
found to be either specifically expressed in prostate or highly expressed in prostate compared 
5 to other tissues were analyzed for expression in tumor samples. 

Figure 2 illustrates the expression profile of one DATAS clone in normal and tumor 
prostate tissue. Expression of this clone is upregulated in two of the three tumor pooled 
samples and is highly expressed in three of the four individual tumor samples. The high 
expression of this spUce event in tumor samples as compared to normal prostate, and the low 

10 expression in other normal human tissues is an example of one candidate that has utility for 
development as a novel epitope for prostate cancer. 

The spHce events for DATAS clones that displayed a specific expression profile for 
prostate and a high differential expression profile for prostate tumors were isolated and the 
sequences for each event was determined. An example is shown in figure 3, where the 

15 sequence of the isolated event was mapped to the genome in Blat, and genomic viewer 
developed bythebioinformaticsdepartmentatUCSC (Kent et al., 2002). Five distinct clones 
were isolated that mapped to the gene locus for STEAP2. One expressed sequence tag (EST), 
AK092666, contained many similar domains as the spUce events that were isolated using 
DATAS. The sequences andpredicted protem translations for all five clones are described in 

20 SEQ NOS. 173 - 182 and are graphically illustrated in figure 3. The length of the open 
reading fimne and the predicted protein size for each isoform is described in Table 2. The 
EST, AK092666 contains a large deletion in exon 5, the terminal 3' exon of STEAP2, with 
two novel exons in the 5 ' region of the transcript The nomenclature for the DATAS derived 
events was based on AK092666 because of higher similarity when compared to the RefSeq 

25 sequence for STEAP2. The first isofoim identified, AK092666_01 (SEQ ID NO 173), 
contains a novel C-teraiinal exon when compared to AK092666, and therefore generates a 
novel junction, and a novel sequence for translation and generates a unique amino acid 
sequence (SEQ ID NO 183). The same novel sequence was generated by isoform 
AK092666_03 (SEQ ID NO 17A which contains the same novel exon with an additional 

30 splicing event of an in fi:ame truncation of exon 4, and by isoforai AK092666_05 (SEQ ID NO 
181), which contains a single codon deletion firom AK092666_01. AK092666_02 (SEQ ID 



wo 2004/113571 



53 



PCT/IB2004/002394 



NO 175) skipped exon 6 of AK092666 and generated the novel amino acid sequence in SEQ 

IDN0184.AK092«i6_04(SEQIDN0179)containsa.hortoutofftametnn>cationofexon 
4. wUch results in the creation of 8 novel aninao acids before encountering a premature stop 
codon (SEQ ID NO 185). 



1™ ""^^ '^^^^^ predicted protein size for 



each novel 



AK092666 



AX092666 01 



Clone Name 



AK092666 02 



AK092666 03 



AK092666 04 
AK092666 05 



ORF length (bp) 



1473 



1365 



1389 



52.7 



1260 



900 



705 



1386 



Protein size (KD) 



56 



51.7 



47.8 



34.1 



26.7 



52.5 



The novel amino acids found in SEQ ID NOS 183 and 184 represent novel epitopes 
that are specifically expressed in prostate cancer in a membrane protein. Hiese epitopes are 
targets for monoclonal antibody immunotherapy for the treatment of prostate cancer. To 
illustrate the different isofomis present, an antibody was generated from the invariant sequence 
present in the 5 ' region (or the amino terminal portion of the piotein) that recognizes all the 
different isofomis. 

An antibody was generated against an amino acid sequence that was common to all 
five isofoims. as weU as present in STEAP2 and AK092666. Prostate cancer cell lines were 
analyzed by western blot to determine what different isoforms would be expressed at the 
protein level. Figure 4 illustrates two bands that were specifically detected by the antibody 
Band A potentially represents the glycosylated, wild type STEAP2 and band B indicates 
isofonns AK092666, AK092666_01, or AK092666_05, which is unresolvable in the gel 

analysis. In addition, multiplebandsofthepropersizeweredetectedsuggestingthatisofomi^ 
of the STEAP2 locus are expressed and rq,resent targets for the immunotherapy in prostate 

cancer. 
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TableL. Sequence infortnation of the DATAS fragments and the alternatively spliced 
isofonns. 



Sequence ID: No. 1 
5 Accession #: NM_005656 

Genomic sequence: chr21 : 39407238-39450894 

Sequence definition: transmembrane protease serine 2 

10 Sequence ID: No. 2 

Accession #: NM_001423 

Genomic sequence: chrl2 : 13265134-13265266 

Sequence definition: Homo sapiens epithelial membrane protein 1 EMPl 

15 Sequence ID: No. 3 

Accession #: NM_000484 

Genomic sequence: chr21 : 23832850-24123073 
Sequence definition: beta amyloid A4 

20 Sequence ID: No. 4 

Accession #: NM_002841 

Genomic sequence: chr3 : 62548596-63240788_l 

Sequence definition: protein tyrosine phosphatase G-type 

25 Sequence ID: No. 5 

Accession #: NM_022124 

Genomic sequence: chrlO : 7 4968313-75112962 

Sequence definition: cadherin related 23 isoform 1 precursor 

30 Sequence ID: No. 6 

Accession #: NM_033056 

Genomic sequence: chrlO: 55940286-56920530 02 
Sequence definition: protocadherin 15 precursor 

35 Sequence ID: No. 7 

Accession #: NM_002847 

Genomic sequence: chr7 : 15858 6667-159621018 01 

Sequence definition: protein tyrosine phosphatase receptor type N 

40 Sequence ID: No. 8 

Accession #: NM__002222 

Genomic sequence: chr3 : 5000696-5354641_l 

Sequence definition: ITPR inositol 145-triphosphate receptor type 1 

45 Sequence ID: No. 9 

Accession #: AC078864.20 

Genomic sequence: chrl2 : 52201280-52201714 
Sequence definition: Genscan prediction 

50 Sequence ID: No. 10 

Accession #: NM_014554; NM_001844; NT 009785.3 
Genomic sequence: chrl2 : 45785273-45856561 

Sequence definition: chrl2_498 potential fusion of SENPl and Collagen 2A; 
also overlaps GS perdiction ^ 



55 



Sequence ID: No. 11 

Accession #: AB064 665 

Genomic sequence: chrM: 9411-9524 

Sequence definition: Homo sapiens mRNA for 0K/SW-CL.16 
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Sequence ID: No. 12 
Accession #: NM_024 029 

Genomic sequence: chrl9 : 10880041-10883719 
Sequence definition: hypothetical protein MGC3262 

Sequence ID: No. 13 
Accession #: NT_008748.79 

Genomic sequence: chrlO : 80881918-80882092 
Sequence definition: Genscan prediction 

Sequence ID: No. 14 
Accession #: AB002360 

Genomic sequence: chrl3 : 112761227-112761344 
Sequence definition: KIAA0362 

Sequence ID: No. 15 
Accession #: AK057572 

Genomic sequence: chrl6 : 14547315-14547422 
Sequence definition: FLJ33010 

Sequence ID: No. 16 

Accession #: NT_034 410 . 5 6/NM_033102 1 

KSuSnce^dof^^^ ^^^1^203503646-203554883/chrl:192169879.192474008 
prSrL2S^54i4""^ " El]c4/LOC85414 - Homo sapiens prostein 

Sequence ID: No. 17 
Accession #: NT_019696.29 

Genomic sequence: chrx : 64173951-64275396 
Sequence definition: Genscan prediction 

Sequence ID: No. 18 

Accession #: NT_007834.17 

Genomic sequence: chr7 : 71656530-71727938 

Sequence definition: Genscan prediction 

Sequence ID: No. 19 
Accession #: NT__0054 03 . 1000 

Genomic sequence: chr2 : 208067141-208067324 
Sequence definition: Genscan prediction 

Sequence ID: No. 20 
Accession #: NT_009654.19 

Genomic sequence: chrl2 : 116716120-116840364 
Sequence definition: Genscan prediction 

Sequence ID: No. 21 
Accession #: AC126564,7 

Genomic sequence: chrl2 : 1314 40407-131440735 
Sequence definition: genomic match 

Sequence ID: No. 22 
Accession #: NT_006171.64 

Genomic sequence: chr4 : 172269202-172299375 
Sequence definition: Genscan prediction 

Sequence ID: No. 23 
Accession #: NM_02514 9.1 

Genomic sequence: chrl7 : 61361324-61409903 
Sequence definition: FLJ20920 
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Sequence ID: No. 24 

Accession #: NT_026437 . 145 

Genomic sequence: chrl4 : 72272372-724 62407 

Sequence definition: Genscan prediction 

Sequence ID: No. 25 
Accession #: NT_030059,13 

Genomic sequence: chrlO : 103933731-103955924 
Sequence definition: Genscan prediction 

Sequence ID: No. 26 
Accession #: AK058112 

Genomic sequence: chrl9 : 1815692-1822319 
Sequence definition: FLJ25383 



Sequence ID: No. 27 
Accession #: NM_002205.1 

Genomic sequence: chrl2 : 55541534-55565494 
on ^f^;^®^^®/^fi^ition: Homo sapiens integrin alpha 5 fibronectin receptor 
alpha polypeptide 

Sequence ID: No. 28 
Accession #: NM_004716.1 

Genomic sequence: chrll : 117114115-117114448 

tyS^7''lcSK7^JSNi''''' sapiens proprotein convertase subtilisin/kexin 

Sequence ID: No. 29 
Accession #: NM_030774 
30 Genomic sequence: chrll : 5003431-5021099 

Sequence definition: prostate specific G-protein coupled receptor [Homo 
sapxens] r- t 

Sequence ID: No. 30 
35 Accession #: AB007932 

Genomic sequence: chrl : 204846394-204846755 

Sequence definition: Homo sapiens plexin A2 PLXNA2 mRNA 

Sequence ID: No. 31 
40 Accession #: AB023177 

Genomic sequence: chr7 : 11157196-11157402 

Sequence definition: Homo sapiens mRNA for KI7^0960 protein 

Sequence ID: No. 32 
45 Accession #: NT_004858.23 

Genomic sequence: chrl : 147688399-147725025 
Sequence definition: Genscan prediction 

Sequence ID: No. 33 
50 Accession #: NT_004873.61 

Genomic sequence: chrl: 14678698-14732191 
Sequence definition: Genscan prediction 

Sequence ID: No. 34 
55 Accession #: NT_029860.99 

Genomic sequence: chrl : 110751286-110854188 
Sequence definition: Genscan prediction 

Sequence ID: No. 35 
60 Accession #: NM_032385.1 

Genomic sequence: chrS : 170086201-170251515 

Sequence definition: Homo sapiens chromosome 5 open reading frame 4 
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C5orf4 

Sequence ID: No. 36 
Accession #: NM_014752.1 

Genomic sequence: chrll : 73182947-73211393 
Sequence definition: KIAA0102 

Sequence ID: No. 37 
Accession #: NP_000295 

Genomic sequence: chrl7 : 15500091-15500332 

Sequence definition: Homo sapiens peripheral myelin protein 22 

Sequence ID: No. 38 
Accession #: NM_020433 

Genomic sequence: chr20 : 42528457-42528759 
Sequence definition: Homo sapiens junctophilin 2 

Sequence ID: No. 39 

Accession #: NT_999999.2 

Genomic sequence: chrM: 9411-9524 

Sequence definition: Genscan Gene Predictions 

Sequence ID: No. 40 
Accession #: NT_004754.1 

Genomic sequence: chrl : 117988850-117989247 
Sequence definition: Genscan Gene Predictions 

Sequence ID: No. 41 

Accession #: NT_011568 . 108 

Genomic sequence: chrX: 47583156-47583796 

Sequence definition: Acembly Gene Predictions /Genscan Gene Predictions 

Sequence ID: No. 42 
Accession #: NP_061116 

Genomic sequence: chr7 : 140900079-140900876 

Sequence definition: transient receptor potential cation channel 

Sequence ID: No. 43 
Accession #: NT_011295.163 

Genomic sequence: chrl9 : 19799239-19804450 
Sequence definition: Genscan prediction 

Sequence ID: No. 44 
Accession #: NP_056051 

Genomic sequence: chr4 : 62284401-62284770 
Sequence definition: lectomedin-3 

Sequence ID: No. 45 
Accession #: NT_033927.57 

Genomic sequence: chrll : 75518014-75562375 
Sequence definition: Genscan prediction 

Sequence ID: No. 46 
Accession #: NM_030774 

Genomic sequence: chrll : 5003431-5021099 

saSeSr ^^^^^^^^^^^ prostate specific G-protein coupled receptor Homo 

Sequence ID: No. 47 
Accession #: NM_022119 

Genomic sequence: chrl6 : 2939532-2939842 
Sequence definition: protease serine 22 
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Sequence ID: No. 48 
Accession #: NP_000155 

Genomic sequence: chrl9: 4 6824 678-4 6824801 

sequence definition: Homo sapiens gastric inhibitory polypeptide receptor 

Sequence ID: No. 49 
Accession #: NM_001627 

Genomic sequence: chr3 : 104784804-104787209 

Sequence definition: activated leukocyte cell adhesion molecule 

Sequence ID: No. 50 
Accession #: NP__056343 

Genomic sequence: chrl7 : 5263335-5263632 

Sequence definition: Homo sapiens DKFZP566H073 protein 

Sequence ID: No. 51 
Accession #: NT__033275.9 

Genomic sequence: chrl5 : 19767754-19767842 

Sequence definition: Acembly Gene Predictions/Genscan Gene Predictions 

Sequence ID: No. 52 

Accession #: NT_004511 . 105 

Genomic sequence: chrl: 37 657082-37657508 

Sequence definition: Genscan Gene Predictions 

Sequence ID: No. 53 

Accession #: NT_007819.76 

Genomic sequence: chr7 : 2293638-2293859 

Sequence definition: Acembly Gene Predictions/Genscan Gene Predictions 

Sequence ID: No. 54 
Accession #: NT_00804 6 , 179 

Genomic sequence: chr8 : 101509107-101509191 

Sequence definition: Acembly Gene Predictions/Genscan Gene Predictions 

Sequence ID: No. 55 
Accession #: NP_001668 

Genomic sequence: chrl : 166712801-166712951 

Sequence definition: ATPase Na+/K+ transporting beta 1 polypeptide 

Sequence ID: No. 56 
Accession #: NP_061332 

Genomic sequence: chr7 : 105724807-105753208 

Sequence definition: B-cell receptor-associated protein BAP29 

Sequence ID: No. 57 
Accession #: NT_008251.42 

Genomic sequence: chr8 : 36531104-36531405 

Sequence definition: Acembly Gene Predictions/Genscan Gene Predictions 

Sequence ID: No. 58 

Accession #: NT_008984 . 116 

Genomic sequence: chrll : 97792879-97792961 

Sequence definition: Genscan Gene Predictions 

Sequence ID: No. 59 
Accession #: NT_011176.84 

Genomic sequence: chrl9 : 11151260-11154382 

Sequence definition: Acembly Gene Predictions/Genscan Gene Predictions 
Sequence ID: No. 60 
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Accession #: ENST00000255124 

Genomic sequence: chr20 : 46047371-46047445 

Sequence definition: Acembly Gene Predictions /Genscan Gene Predictions 

Sequence ID: No. 61 

Accession #: ENST00000262657 

Genomic sequence: chr20 : 29935469-29937596 

Sequence definition: Acembly Gene Predictions /Genscan Gene Predictions 

Sequence ID: No» 62 
Accession #: NT_033903.44 

Genomic sequence: chrll : 58671001-58671164 
Sequence definition: Genscan Gene Predictions 

Sequence ID: No. 63 
Accession #: NP_000360 

Genomic sequence: chrl4 : 78989775-78989913 

sequence definition: Homo sapiens thyroid stimulating hormone receptor 

Sequence ID: No. 64 
Accession #: NP_005219 

Genomic sequence: chr7 : 54724858-54725037 

e???hroblJ^flo^?i^^^ Homo sapiens epidermal growth factor receptor 
erythroblastic leukemia viral v-erb-b oncogene homolog avian 

Sequence ID: No. 65 
Accession #: NP_14 9093 

Genomic sequence: chrl : 203548697-203549088 
Sequence definition: Homo sapiens prostein protein 

Sequence ID: No. 66 
Accession #: NM_030774 

Genomic sequence: chrll : 5003431-5021099 

saSr ^^^^''^^^^"^ prostate specific G-protein coupled receptor Homo 

Sequence ID: No. 67 
Accession #: NM_030774 

Genomic sequence: chrll : 5004 995-5010301 

llpllnT prostate specific G-protein coupled receptor Homo 

Sequence ID: No. 68 
Accession #: NM_030774 

Genomic sequence: chrll : 5004 983-5010305 

sa^ieSr "^^^^^^^^^^^ prostate specific G-protein coupled receptor Homo 

Sequence ID: No. 69 
Accession #: NM__030774 

Genomic sequence: chrll : 5004983-5010305 

saSr '^^^^^^^^^'^^ prostate specific G-protein coupled receptor Homo 

Sequence ID: No. 70 
Accession #: NM_030774 

Genomic sequence: chrll : 4 667240-4678100 

saSenf "^^^^^^^^^^^ prostate specific G-protein coupled receptor Homo 

Sequence ID: No. 71 
Accession #: NM 030774 
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Genomic sequence: chrll : 4677792-4677987 

TeS-lnT prostate specific G-protein coupled receptor Homo 

Sequence ID: No. 72 
Accession #: NM_030774 

Genomic sequence: chrll : 5003430-5007773 

sap^enr prostate specific G-protein coupled receptor Homo 

Sequence ID: No. 73 
Accession #: AK07554 6 

Genomic sequence: chrll : 36643617-36930167 
Sequence definition: predicted protein 

Sequence ID: No. 74 
Accession #: AK07554 6 

Genomic sequence: chrll : 36643626-36931023 
Sequence definition: predicted protein 

Sequence ID: No. 75 
Accession #: AK07554 6 

Genomic sequence: chrll : 36643617-36929351 
Sequence definition: predicted protein 

Sequence ID: No. 7 6 
Accession #: AK07554 6 

Genomic sequence: chrll : 36643617-36929351 
Sequence definition: predicted protein 

Sequence ID: No. 77 
Accession #: AK07554 6 

Genomic sequence: chrll : 36643617-36929351 
Sequence definition: predicted protein 

Sequence ID: No. 78 
Accession #: NT_033927.57 

Genomic sequence: chrll : 75518014-75562375 
Sequence definition: Genscan prediction 

Sequence ID: No. 79 
Accession #: NM__000300 

Genomic sequence: chrl : 19337078-19342056 

Sequence definition: phospholipase A2 group IIA platelets synovial 

Sequence ID: No. 80 
Accession #: NM_000300 

Genomic sequence: chrl : 19337078-19342056 

Sequence definition: phospholipase A2 group IIA platelets synovial 

Sequence ID: No. 81 
Accession #: NM_000300 

Genomic sequence: chrl : 19337078-19342056 

Sequence definition: phospholipase A2 group IIA platelets synovial 

Sequence ID: No. 82 
Accession #: NM_000300 

Genomic sequence: chrl : 19337078-19342056 

Sequence definition: phospholipase A2 group IIA platelets synovial 

Sequence ID: No. 83 
Accession #: NM 000300 
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Genomic sequence: chrl : 19337078-19342056 

sequence definition: phospholipase A2 group IIA platelets synovial 

Sequence ID: No. 84 
Accession #: NM_032323 

Genomic sequence: chrl : 152017962-152027457 

Sequence definition: hypothetical protein MGC13102 - refseq 

Sequence ID: No. 85 
Accession #: NM_032323 

Genomic sequence: chrl : 152017962-152027457 
Sequence definition: hypothetical protein MGC13102 

Sequence ID: No. 86 
Accession #: NM_032323 

Genomic sequence: chrl : 152017962-152027457 
Sequence definition: hypothetical protein MGC13102 

Sequence ID: No. 87 
Accession #: NM_032323 

Genomic sequence: chrl : 152017962-152027457 
Sequence definition: hypothetical protein MGC13102 

Sequence ID: No. 88 
Accession #: NM_032323 

Genomic sequence: chrl : 152017962-152027457 
Sequence definition: hypothetical protein MGC13102 

Sequence ID: No. 89 
Accession #: NM_032323 

Genomic sequence: chrl : 152017962-152027457 
Sequence definition: hypothetical protein MGC13102 

Sequence ID: No. 90 
Accession #: NM_032323 

Genomic sequence: chrl : 152017962-152027457 
Sequence definition: hypothetical protein MGC13102 

Sequence ID: No. 91 
Accession #: AK092666 

Genomic sequence: chr7 : 88376306-88402240 
Sequence definition: STEAP2/AK092666 

Sequence ID: No. 92 
Accession #: AK092666 

Genomic sequence: chr7 : 8837 6306-884 02240 
Sequence definition: STEAP2/AK092666 

Sequence ID: No. 93 
Accession #: AK092666 

Genomic sequence: chr7 : 88376306-88402240 
Sequence definition: STEAP2/AK092666 

Sequence ID: No. 94 
Accession #: NM_005656 

Genomic sequence: chr21 : 3949344 6-39537043 
Sequence definition: TMPRSS2 

Sequence ID: No. 95 
Accession #: NM_005656 

Genomic sequence: chr21 : 39493446-39537043 
Sequence definition: TMPRSS2 
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Sequence ID: No. 96 
Accession #: NM_005656 

Genomic sequence: chr21 : 39493446-39537043 
5 Sequence definition: TMPRSS2 

Sequence ID: No. 97 
Accession #: NM_00565 6 

Genomic sequence: chr21 : 39493446-39537043 
lU Sequence definition: TMPRSS2 

Sequence ID: No. 98 
Accession #: NM_005656 

Genomic sequence: chr21 : 3949344 6-39537043 
ID Sequence definition: TMPRSS2 

Sequence ID: No. 99 
Accession #: NM_00447 6 

Genomic sequence: chrll : 50361918-50423952 
zU Sequence definition: PSMA/FOLHl 

Sequence ID: No. 100 
Accession #: NM_00447 6 

Genomic sequence: chrll : 50361918-50423952 
Sequence definition: PSMA/FOLHl 

Sequence ID: No. 101 

Accession #: NM_00447 6 

Genomic sequence: chrll : 50361918-50423952 
oK) Sequence definition: PSMA/FOLHl 

Sequence ID: No, 102 
Accession #: no match to index 
Genomic sequence: No match BLAT 
35 Sequence definition: No match BLAT 

Sequence ID: No. 103 
Accession #: AC105101.8 

Genomic sequence: chrl8 : 45441503-45442177 
Sequence definition: genomic match 



40 



Sequence ID: No. 104 
Accession #: BC043509 

Genomic sequence: chr2 : 7566735-7567210 
Sequence definition: genomic match 

Sequence ID: No. 105 
Accession #: no match to index 
Genomic sequence: No match BLAT 
Sequence definition: No match BLAT 

Sequence ID: No. 106 
Accession #: NT_007914 . 34 5 

Genomic sequence: chr7 : 150965224-150965948 
o:> Sequence definition: Genscan prediction 



50 
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Sequence ID: No. 107 
Accession #: NM_002474 

Genomic sequence: chrl6: 15123743-15124024 

Sequence definition: smooth muscle myosin heavy chain 11 isoform 
Sequence ID: No. 108 
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Accession #: no match to index 
Genomic sequence: No match BLAT 
Sequence definition: No match BLAT 

5 Sequence ID: No. 109 

Accession #: 7U:.450472 • 14 

Genomic sequence: chrX : 132596913-132597349 
Sequence definition: genomic match 

10 Sequence ID: No. 110 

Accession #: no match to index 
Genomic sequence: No match BLAT 
Sequence definition: No match BLAT 

15 Sequence ID: No. Ill 

Accession #: NM_024 4 90 

Genomic sequence: chrl5 : 18 676827-18681314 
Sequence definition: ATPase Class V type lOA 

20 Sequence ID: No. 112 

Accession #: NT_007741.24 

Genomic sequence: chr7 : 154483727-154484200 
Sequence definition: Genscan prediction 

25 Sequence ID: No. 113 

Accession #: NT_010168.1 

Genomic sequence: chrl4 : 100136759-100137109 
Sequence definition: Genscan prediction 

30 Sequence ID: No. 114 
Accession #: AK074158 

Genomic sequence: chr7 : 2347770-2347996 

Sequence definition: Homo sapiens mRNA for FLJ00231 protein 

35 Sequence ID: No. 115 

Accession #: no match to index 
Genomic sequence: No match BLAT 
Sequence definition: No match BLAT 

40 Sequence ID: No. 116 
Accession #: AL54 9429 

Genomic sequence: chrll : 9027915-9028089 
Sequence definition: genomic match 

45 Sequence ID: No. 117 

Accession #: NM_015541 

Genomic sequence: chr3 : 658 99978-65900329 

Sequence definition: leucine-rich repeats and immunoglobulin-like 

50 Sequence ID: No. 118 

Accession #: NM_024897 

Genomic sequence: chrl : 151978744-151978881 
Sequence definition: hypothetical protein FLJ22672 

55 Sequence ID: No. 119 

Accession #: NM_006598 

Genomic sequence: chr5 : 1165896-1168793 

Sequence definition: solute carrier family 12 potassium/chloride 

60 Sequence ID: No. 120 

Accession #: NM_021569 

Genomic sequence: chr9 : 131740238-131740388 
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Sequence definition: NMDA receptor 1 isoform NRl-2 precursor 

Sequence ID: No. 121 
Accession #: AL445467.6 

Genomic sequence: chrX: 15985515-15985779 
Sequence definition: genomic match 

Sequence ID: No. 122 
Accession #: BM97 6799 

Genomic sequence: chrl : 54049149-54049432 
Sequence definition: genomic/EST match 

Sequence ID: No. 123 
Accession #: no match to index 
Genomic sequence: No match BLAT 
Sequence definition: No match BLAT 

Sequence ID: No. 124 

Accession #: NT_007933 . 414 

Genomic sequence: chr7 : 98285605-98286140 

Sequence definition: Genscan prediction 

Sequence ID: No. 125 
Accession #: NM_020428 

Genomic sequence: chrl9: 10964586-10965036 

Sequence definition: Homo sapiens CTL2 gene CTL2 mRNA 

Sequence ID: No. 126 
Accession #: no match to index 
Genomic sequence: No match BLAT 
Sequence definition: No match BLAT 

Sequence ID: No. 127 
Accession #: NM_006292 

Genomic sequence: chrll : 19444265-19444422 

Sequence definition: Homo sapiens tumor susceptibility gene 101 TSGlOl 

Sequence ID: No. 128 
Accession #: NM_052932 

Genomic sequence: chrll : 102306433-102306907 

fSSr^genf pSRmJS^n^r '"^'""^ Pro-oncosis receptor inducing xaen^rane 

Sequence ID: No. 129 
Accession #: NM_000014 

Genomic sequence: chrl2 : 9416444-9416720 

Sequence definition: Homo sapiens alpha-2-macroglobulin A2M mRNA 

Sequence ID: No. 130 
Accession #: NM_002337 

Genomic sequence: chr4 : 3426547-3433294 

Sequence definition: low density lipoprotein-related 

Sequence ID: No. 131 
Accession #: AL834445 

Genomic sequence: chr20 : 23304135-23304477 

Sequence definition: Homo sapiens mRNA; cDNA DKFZp761J109 

Sequence ID: No. 132 
Accession #: NM_004 986 

Genomic sequence: chrl4 : 4 9879277-49880762 
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Sequence definition: kinectin 1 

Sequence ID: No. 133 
Accession #: NM_024295 

Genomic sequence: chr8 : 124092754-124095061 
Sequence definition: hypothetical protein MGC3067 

Sequence ID: No. 134 
Accession #: AC018457.14 

Genomic sequence: chr3 : 165236534-165236724 
Sequence definition: genomic match 

Sequence ID: No, 135 
Accession #: NM_004753 

Genomic sequence: chrl : 12208898-12258427 

SDRrj;SNA''^^^"^''^°''' ^^P^^"^ short-chain dehydrogenase/reductase 

Sequence ID: No. 13 6 
Accession #: NM_004753 

Genomic sequence: chrl : 12221576-12258383 

SKrna'^^^^''^''^''''' ^^^^^^^ Short-chain dehydrogenase/reductase 

Sequence ID: No. 137 
Accession #: NM_004753 

Genomic sequence: chrl : 12221576-12258383 

ioRr^l^NA'^^^^''^^'^''''' ^^^^^"^ short-chain dehydrogenase/reductase 

Sequence ID: No. 138 
Accession #: NM_004753 

Genomic sequence: chrl : 12221576-12258383 

SS^SSSa'^^^^''^''^''''' '''''''' ^^^^^""^ short-chain dehydrogenase/reductase 

Sequence ID: No. 139 
Accession #: NM_004 753 

Genomic sequence: chrl : 12221576-12258383 

SDRrjlSNA'^^^^''^''^''''' ''''''^ ^^^^^""^ short-chain dehydrogenase/reductase 

Sequence ID: No. 140 
Accession #: D87438 

Genomic sequence: chrl6 : 14996279-15058862 

Sequence definition: Human mRNA for KIAA0251 gene partial cds 

Sequence ID: No. 141 
Accession #: D87438 

Genomic sequence: chrl6 : 15018972-15027737 

Sequence definition: Human mRNA for KIAA0251 gene partial cds 

Sequence ID: No. 142 
Accession #: AB007932 

Genomic sequence: chrl : 204843635-205060532 

Sequence definition: Homo sapiens plexin A2 long form PLXNA2 mRNA 

Sequence ID: No. 14 3 
Accession #: AB007932 

Genomic sequence: chrl : 204843635-205060532 

Sequence definition: Homo sapiens plexin A2 long form PLXNA2 mRNA 
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Sequence ID: No. 144 
Accession #: AB007932 

Genomic sequence: chrl : 204843635-205060532 
^ Sequence definition: Homo sapiens plexin A2 long form PLXNA2 mRNA 

Sequence ID: No. 14 5 
Accession #: AB037745 

Genomic sequence: chrl : 108833848-108851509 

Sequence definition: Homo sapiens mRNA for KIAA1324 protein partial cds 

Sequence ID: No. 14 6 
Accession #: AB037745 

Genomic sequence: chrl: 108851126-108851424 

Sequence definition: Homo sapiens mRNA for KIAA1324 protein partial cds 

Sequence ID: No. 147 

Accession #: AB037745 

Genomic sequence: chrl : 108851126-108851424 

Sequence definition: Homo sapiens mRNA for KIAA1324 protein partial cds 

Sequence ID: No. 148 
Accession #: AB037745 

Genomic sequence: chrl: 108851126-108851424 

Sequence definition: Homo sapiens mRNA for KIAA1324 protein partial cds 

Sequence ID: No. 14 9 
Accession #: AB037745 

Genomic sequence: chrl : 108851126-108851424 

Sequence definition: Homo sapiens mRNA for KIAA1324 protein partial cds 

Sequence ID: No. 150 
Accession #: NM_002253 

Genomic sequence: chr4 : 55795152-55795458 

35 ^f?'' sapiens kinase insert domain receptor a type 

^3 III receptor tyrosine kinase KDR mRNA 

Sequence ID: No. 151 
Accession #: NM_004879 

Genomic sequence: chrll : 125479160-125481382 
4U Sequence definition: Homo sapiens etoposide induced 2.4 mRNA EI24 mRNA 

Sequence ID: No. 152 
Accession #: BC041788 

Genomic sequence: chr8 : 14484144 9-144841809 
4!) Sequence definition: Homo sapiens Similar to RIKEN cDNA 1110025J15 aene 
clone MGC: 32881 IMAGE: 4738372 mRNA complete cds 1J.1U025J15 gene 

Sequence ID: No. 153 
Accession #: AB033073 
50 Genomic sequence: chr20 : 4 6925235-46925516 

Sequence definition: Homo sapiens mRNA for KIAA1247 protein partial cds 

Sequence ID: No. 154 
Accession #: NT_011520 . 136 
55 Genomic sequence: chr22 : 21548074-21562329 
Sequence definition: Genscan prediction 

Sequence ID: No. 155 
Accession #: NM_005581 
60 Genomic sequence: chrl9 : 49998069-49998792 

fSSlSeS LS^iS^A^"'"' sapiens Lutheran blood group Auberger b antigen 
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Sequence ID: No. 156 
Accession #: NM_004355 

Genomic sequence: chr5:149769000~149775442 

m^?nrh?o^^^^''^^i?w sapiens CD7 4 antigen invariant polypeptide of 
ma:or hxstocompatibxlity complex class II antigen-associaLd^D74 m^A 

Sequence ID: No. 157 
Accession #: NM_000484 
10 Genomic sequence: chr21 : 26174 980-26175131 

Sequence definition: Homo sapiens amyloid beta A4 precursor protein 
protease nexin-II Alzheimer disease APP itiRNA ^cursor protexn 

Sequence ID: No. 158 
15 Accession #: NM_00574 5 

Genomic sequence: chrX: 150566783-150575554 

Sequence definition: Homo sapiens accessory protein BAP31 BCAP31 mRNA 

Sequence ID: No. 159 
20 Accession #: NM_005570 

Genomic sequence: chrl8 : 56780509-56781078 

Sequence definition: Homo sapiens lectin mannose-binding 1 LMANl mRNA 

Sequence ID: No. 160 
25 Accession #: NT_029218.14 

Genomic sequence: chrl : 19080562-19080917 
Sequence definition: Genscan prediction 

Sequence ID: No. 161 
30 Accession #: NT_011387.8 

Genomic sequence: chr20 : 410654-410816 
Sequence definition: Genscan prediction 

Sequence ID: No. 162 
35 Accession #: NM_002587 

Genomic sequence: chr5 : 141227996-141231527 

Sequence definition: Homo sapiens protocadherin 1 PDCHl 

Sequence ID: No. 163 
40 Accession #: NT_035036.5 

Genomic sequence: chrlO : 51263955-51274232 
Sequence definition: Genscan prediction 

Sequence ID: No. 164 
45 Accession #: NM_007176 

Genomic sequence: chrl4 : 74107662-74107815 

Sequence definition: Homo sapien Chr 14 open reading frame 

Sequence ID: No. 165 
50 Accession #: AP000531.1 

Genomic sequence: chr22 : 14703272-14703359 
Sequence definition: poor genomic match to repeat 

Sequence ID: No. 166 
55 Accession #: NM_020182 

Genomic sequence: chr20 : 56850452-56936716 

^nSElA^^^Nf ^^''^ ^^^^^^^ transmembrane prostate androgen induced 

60 Sequence ID: No, 167 

Accession #: NM_020182 

Genomic sequence: chr20:56850452-56936716 
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SnSElAf^nSi''""' ^^^"-^"^ transmembrane prostate androgen induced 

Sequence ID: No. 168 
Accession #: NM_020182 

Genomic sequence: chr20 : 56850452-56936716 

?Nr?SE?A?^iSf """'^ transmembrane prostate androgen induced 

Sequence ID: No. 169 
Accession #: NM_020182 

Genomic sequence: chr20 : 56850452-56936716 

ISfTS^kf SSi''^''''' ^^^^^^^ transmembrane prostate androgen induced 

Sequence ID: No. 170 

Accession #: NM_020182 

Genomic sequence: chr20 : 56850452-56936716 

RSA^?MEL?SSi'''^°''' ^^^^^"^ transmembrane prostate androgen induced 

Sequence ID: No. 171 
Accession #: NM_020182 

Genomic sequence: chr20 : 56850452-56936716 

SnS^^A? SSi''^^''' ^^^^^""^ transmembrane prostate androgen induced 

Sequence ID: No. 172 
Accession #: NM_020182 

Genomic sequence: chr20 : 56850452-56936716 

RNnS^lA^^RS''^''''' ^^^^^""^ transmembrane prostate androgen induced 

Sequence ID: No. 173 
Accession #: AK092 666_01 

Sequence definition: Novel spliced isoform of STEAP2 

Sequence ID: No. 174 
Accession #: AK092666_01 

ST^tT^ definition: Protein translation of novel spliced isoform of 

Sequence ID: No. 175 
Accession #: AK092666_02 

Sequence definition: Novel spliced isoform of STEAP2 

Sequence ID: No. 17 6 
Accession #: AK092 666_02 

STmlT^ definition: Protein translation of novel spliced isoform of 

Sequence ID: No. 177 
Accession #: AK092666_03 

Sequence definition: Novel spliced isoform of STEAP2 

Sequence ID: No. 178 
Accession #: AK092666_03 

S?E^ir^ definition: Protein translation of novel spliced isoform of 

Sequence ID: No. 179 
Accession #: AK092666 04 
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Sequence definition: Novel spliced isoform of STEAP2 

Sequence ID: No. 180 
Accession #: AK092666_04 

S?Sir^ ^^^^''^^^^"^ Protein translation of novel spliced isoform of 

Sequence ID: No, 181 
Accession #: AK092666_05 

Sequence definition: Novel spliced isoform of STEAP2 

Sequence ID: No. 182 
Accession #: AK092666 05 



Sequence definition: Novel spliced isoform of STEAP2 

Sequence ID: No. 183 
Accession #: AK092666_01aa 

i;?s;reij?f'ssi?s^e5?r^^sSi,i3f ^^^^^^ "o*-^ 

Sequence ID: No. 184 

Accession #: AK092666_02aa 

SaSel 2f Novel amino acids generated by spliced isoform 

25 

Sequence ID: No. 185 

Accession #: AK092666_04aa 

AK092666 ^^^^^^ ^=^^3 generated by spliced isoform 

30 
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